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In the cumulative experience with 
measures of individual differences 
over the past 50 years, tests have 
been accepted as valid or discarded 
as invalid by research experiences of 
many sorts. The criteria suggested in 
this paper are all to be found in such 
cumulative evaluations, as well as in 
the recent discussions of validity. 
These criteria are clarified and imple- 
mented when considered jointly in 
the context of a multitrait-multi- 
method matrix. Aspects of the valida- 
tional process receiving particular 
emphasis are these: 

1. Validation is typically conver- 
gent, a confirmation by independent 
measurement procedures. Independ- 
ence of methods is a common denom- 
inator among the major types of 
validity (excepting content validity) 


1The new data analyses reported in this 
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Graduate School of Northwestern University 
and by the Department of Psychology of the 
University of Chicago. We are also indebted 
to numerous colleagues for their thoughtful 
criticisms and encouragement of an earlier 
draft of this paper, especially Benjamin S. 
Bloom, R. Darrell Bock, Desmond S. Cart- 
wright, Loren J. Chapman, Lee J. Cronbach, 
Carl P. Duncan, Lyle V. Jones, Joe Kamiya, 
Wilbur L. Layton, Jane Loevinger, Paul E. 
Meehl, Marshall H. Segall, Thornton B. Roby, 
Robert C. Tryon, Michael Wertheimer, and 
Robert F. Winch. 


insofar as they are to be distinguished 
from reliability. 

2. For the justification of novel 
trait measures, for the validation of 
test interpretation, or for the estab- 
lishment of construct validity, dis- 
criminant validation as well as con- 
vergent validation is required. Tests 
can be invalidated by too high cor- 
relations with other tests from which 
they were intended to differ. 

3. Each test or task employed for 
measurement purposes is a f(rait- 
method unit, a union of a particular 
trait content with measurement pro- 
cedures not specific to that content. 
The systematic variance among test 
scores can be due to responses to the 
measurement features as well as re- 
sponses to the trait content. 

4. In order to examine discrim- 
inant validity, and in order to esti- 
mate the relative contributions of 
trait and method variance, more than 
one trait as well as more than one 
method must be employed in the vali- 
dation process. In many instances it 
will be convenient to achieve this 
through a_  multitrait-multimethod 
matrix. Such a matrix presents all of 
the intercorrelations resulting when 
each of several traits is measured by 
each of several methods. 

To illustrate the suggested valida- 
tional process, a synthetic example is 
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TABLE 1 





A SYNTHETIC MULTITRAIT-MULTIMETHOD MATRIX 








Traits f 


Method 1 


Method 2 


Method 3 


C; 


Method 2 


Method 3 





Cy As B; Cy 





Note.—The validity diagonals are the three sets of italicized values. The reliability diagonals are the three sets 
of values in parentheses. Each heterotrait-monomethod triangle is enclosed by a solid. line. Each heterotrait- 


heteromethod triangle is enclosed by a broken line. 


presented in Table 1. This illustra- 
tion involves three different traits, 
each measured by three methods, 
generating nine separate variables. It 
will be convenient to have labels for 
various regions of the matrix, and 
such have been provided in Table 1. 
The reliabilities will be spoken of in 
terms of three reliability diagonals, 
one for each method. The reliabilities 
could also be designated as the mono- 
trait-monomethod values. Adjacent 
to each reliability diagonal is the 
heterotrait-monomethod triangle. The 
reliability diagonal and the adjacent 
heterotrait-monomethod triangle 
make up a monomethod block. A heter- 
omethod block is made up of a validity 
diagonal (which could also be desig- 
nated as monotrait-heteromethod 
values) and the two heterotrait-hetero- 
method triangles lying on each side of 
it. Note that these two heterotrait- 


heteromethod triangies are not iden- 
tical. 

In terms of this diagram, four as- 
pects bear upon the question of valid- 


ity. In the first place, the entries in 
the validity diagonal should be sig- 
nificantly different from zero and 
sufficiently large to encourage further 
examination of validity. This re- 
quirement is evidence of convergent 
validity. Second, a validity diagonal 
value should be higher than the val- 
ues lying in its column and row in the 
heterotrait-heteromethod _ triangles. 
That is, a validity va'ue for a variable 
should be higher thar the correlations 
obtained between that variable and 
any other variable having neither 
trait nor method in:common. This 
requirement may seem so minimal 
and so obvious as to not need stating, 
yet an inspection of the literature 
shows that it is frequently not met, 
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and may not be met even when the 
validity coefficients are of substantial 
size. In Table 1, all of the validity 
values meet this requirement. A 
third common-sense desideratum is 
that a variable correlate higher with 
an independent effort to measure the 
same trait than with measures de- 
signed to get at different traits which 
happen to employ the same method. 
For a given variable, this involves 
comparing its values in the validity 
diagonals with its values in the heter- 
otrait-monomethod triangles. For 
variables A;, B:, and Ci, this require- 
ment is met to some degree. For the 
other variables, Ag, A; etc., it is not 
met and this is probably typical of 
the usual case in individual differ- 
ences research, as will be discussed in 
what follows. A fourth desideratum 


is that the same pattern of trait in- 
terrelationship be shown in all of the 
heterotrait triangles of both the mon- 
omethod and heteromethod blocks. 
The hypothetical data in 


Table 1 
meet this requirement to a very 
marked degree, in spite of the dif- 
ferent general levels of correlation in- 
volved in the several heterotrait tri- 
angles. The last three criteria pro- 
vide evidence for discriminant va- 
lidity. 

Before examining the multitrait- 
multimethod matrices available in 
the literature, some explication and 
justification of this complex of re- 
quirements seems in order. 

Convergence of independent methods: 
the distinction between reliability and 
validity. Both reliability and validity 
concepts require that agreement be- 
tween measures be demonstrated. A 
common denominator which most 
validity concepts share in contradis- 
tinction to reliability is that this 
agreement represent the convergence 
of independent approaches. The con- 
cept of independence is indicated by 


such phrases as “external variable,” 
“criterion performance,” ‘‘behavioral 
criterion’’ (American Psychological 
Association, 1954, pp. 13-15) used in 
connection with concurrent and pre- 
dictive validity. For construct valid- 
ity it has been stated thus: ‘‘ Numer- 
ous successful predictions dealing 
with phenotypically diverse ‘criteria’ 
give greater weight to the claim of 
construct validity than do... pre- 
dictions involving very similar be- 
havior’ (Cronbach & Meehl, 1955, p. 
295). The importance of independ- 
ence recurs in most discussions of 
proof. For example, Ayer, discussing 
a historian’s belief about a past 
event, says “if these sources are 
numerous and independent, and if 
they agree with one another, he will 
be reasonably confident that their ac- 
count of the matter is correct’”’ (Ayer, 
1954, p. 39). In discussing the man- 
ner in which abstract scientific con- 
cepts are tied to operations, Feigl 
speaks of their being “‘fixed”’ by “‘tri- 
angulation in logical space’’ (Feigl, 
1958, p. 401). 

Independence is, of course, a mat- 
ter of degree, and in this sense, relia- 
bility and validity can be seen as re- 
gions on a continuum. (Cf. Thur- 
stone, 1937, pp. 102-103.) Reliability 
is the agreement between two efforts 
to measure the same trait through 
maximally similar methods. Validity 
is represented in the agreement be- 
tween two attempts to measure the 
same trait through maximally differ- 
ent methods. A split-half reliability 
is a little more like a validity coeffi- 
cient than is an immediate test-retest 
reliability, for the items are not quite 
identical. A correlation between 
dissimilar subtests is probably a reli- 
ability measure, but is still closer to 
the region called validity. 

Some evaluation of validity can 
take place even if the two methods 
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are not entirely independent. In 
Table 1, for example, it is possible 
that Methods 1 and 2 are not en- 
tirely independent. If underlying 
Traits A and B are entirely inde- 
pendent, then the .10 minimum cor- 
relation in the heterotrait-hetero- 
method triangles may reflect method 
covariance. What if the overlap of 
method variance were higher? All 
correlations in the heteromethod 
block would then be elevated, includ- 
ing the validity diagonal. The hetero- 
method block involving Methods 2 
and 3 in Table 1 illustrates this. The 
degree of elevation of the validity 
diagonal above the heterotrait-heter- 
omethod triangles remains compa- 
rable and relative validity can still be 
evaluated. The interpretation of the 
validity diagonal in an absolute fash- 
ion requires the fortunate coincidence 
of both an independence of traits 
and an independence of methods, 
represented by zero values in the 
heterotrait-heteromethod _ triangles. 
But zero values could also occur 
through a combination of negative 
correlation between traits and posi- 
tive correlation between methods, or 
the reverse. In practice, perhaps all 
that can be hoped for is evidence for 
relative validity, that is, for common 
variance specific to a trait, above and 
beyond shared method variance. 
Discriminant validation. While the 
usual reason for the judgment of in- 
validity is low correlations in the 
validity diagonal (e.g., the Downey 
Will-Temperament Test [Symonds, 
1931, p. 337ff]) tests have also been 
invalidated because of too high cor- 
relations with other tests purporting 
to measure different things. The 
classic case of the social intelligence 
tests is a case in point. (See below 
and also [Strang, 1930; R. Thorndike, 
1936].) Such invalidation occurs 
when values in the heterotrait-hetero- 


method triangles are as high as those 
in the validity diagonal, or even 
where within a monomethod block, 
the heterotrait values are as high as 
the reliabilities. Loevinger, Gleser, 
and DuBois (1953) have emphasized 
this requirement in the development 
of maximally discriminating subtests. 

When a dimension of personality is 
hypothesized, when a construct is 
proposed, the proponent invariably 
has in mind distinctions between the 
new dimension and other constructs 
already in use. One cannot define 
without implying distinctions, and 
the verification of these distinctions 
is an important part of the valida- 
tional process. In discussions of con- 
struct validity, it has been expressed 
in such terms as ‘‘from this point of 
view, a low correlation with athletic 
ability may be just as important and 
encouraging as a high correlation 
with reading comprehension” (APA, 
1954, p. 17). 

The test as a trait-method unit. In 
any given psychological measuring 
device, there are certain features or 
stimuli introduced specifically to 
represent the trait that it is intended 
to measure. There are other features 
which are characteristic of the 
method being employed, features 
which could also be present in efforts 
to measure other quite different 
traits. The test, or rating scale, or 
other device, almost inevitably elicits 
systematic variance in response due 
to both groups of features. To the ex- 
tent that irrelevant method variance 
contributes to the scores obtained, 
these scores are invalid. 

This source of invalidity was first 
noted in the “halo effects’ found in 
ratings (Thorndike, 1920). Studies 
of individual differences among lab- 
oratory animals resulted in the recog- 
nition of ‘‘apparatus factors,’’ usu- 
ally more dominant than psychologi- 
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cal process factors (Tryon, 1942). 
For paper-and-pencil tests, methods 
variance has been noted under such 
terms as ‘‘test-form factors’’ (Ver- 
non: 1957, 1958) and “‘response sets”’ 
(Cronbach: 1946, 1950; Lorge, 1937). 
Cronbach has stated the point partic- 
ularly clearly: ‘‘The assumption is 
generally made...that what the 
test measures is determined by the 
content of the items. Yet the final 
score .. . is a composite of effects re- 
sulting from the content of the item 
and effects resulting from the form 
of the item used’’ (Cronbach, 1946, 
p. 475). ‘‘Response sets always lower 
the logical validity of a test.... 
Response sets interfere with infer- 
ences from test data’ (p. 484). 

While E. L. Thorndike (1920) was 
willing to allege the presence of halo 
effects by comparing the high ob- 
tained correlations with common 
sense notions of what they ought to 
be (e.g., it was unreasonable that a 
teacher’s intelligence and voice qual- 
ity should correlate .63) and while 
much of the evidence of response set 
variance is of the same order, the 
clear-cut demonstration of the pres- 
ence of method variance requires 
both several traits and several meth- 
ods. Otherwise, high correlations be- 
tween tests might be explained as due 
either to basic trait similarity or to 
shared method variance. In the 
multitrait-multimethod matrix, the 
presence of method variance is indi- 
cated by the difference in level of cor- 
relation between the parallel values 
of the monomethod block and the 
heteromethod blocks, assuming com- 
parable reliabilities among all tests. 
Thus the contribution of method var- 
iance in Test A; of Table 1 is indi- 
cated by the elevation of 74,5, above 
Ta,By, 1.e., the difference between .51 
and .22, etc. 

The distinction between trait and 


method is of course relative to the 
test constructor’s intent. What is an 
unwanted response set for one tester 
may be a trait for another who wishes 
to measure acquiescence, willingness 
to take an extreme stand, or tendency 
to attribute socially desirable attri- 
butes to oneself (Cronbach: 1946, 
1950; Edwards, 1957; Lorge, 1937). 


MULTITRAIT-MULTIMETHOD Ma- 
TRICES IN THE LITERATURE 


Multitrait-multimethod matrices 
are rare in the test and measurement 
literature. Most frequent are two 
types of fragment: two methods and 
one trait (single isolated values from 
the validity diagonal, perhaps ac- 
companied by a reliability or two), 
and  heterotrait-monomethod _ tri- 
angles. Either type of fragment is 
apt to disguise the inadequacy of our 
present measurement efforts, particu- 
larly in failing to call attention to the 
preponderant strength of methods 
variance. The evidence of test valid- 
ity to be presented here is probably 
poorer than most psychologists would 
have expected. 

One of the earliest matrices of this 
kind was provided by Kelley and 
Krey in 1934. Peer judgments by 
students provided one method, scores 
on a word-association test the other. 
Table 2 presents the data for the four 
most valid traits of the eight he em- 
ployed. The picture is one of strong 
method factors, particularly among 
the peer ratings, and almost total in- 
validity. For only one of the eight 
measures, School Drive, is the value 
in the validity diagonal (.16!) higher 
than all of the heterotrait-hetero- 
method values. The absence of dis- 
criminant validity is further indi- 
cated by the tendency of the values 
in the monomethod triangles to ap- 
proximate the reliabilities. 

An early illustration from the ani- 








86 D. T. CAMPBELL AND D. W. FISKE 
TABLE 2 
PERSONALITY TRAITS OF SCHOOL CHILDREN FROM KELLEY’s STUDY 
( meats) 














Peer Ratings Association Test 








Ai Bi Cc D, Ag B2 Ce D: 

Peer Ratings 

Courtesy Ai (.82) 

Honesty Bi 74 (.80) 

Poise Cc .63 65 (.74) 

School Drive D; .76 .78 65 (.89) 
Association Test 

Courtesy Ae .13 14 .10 .14 (.28) 

Honesty Be .06 12 .16 .08 a lCU ECS 

Poise & 01 .08 .10 ~~ 02 19 = .37.— (.42) 

School Drive D2 as A .14 .16 ae Re .18 (.36) 








mal literature comes from Anderson’s 
(1937) study of drives. Table 3 pre- 
sents a sample of his data. Once 
again, the highest correlations are 
found among different constructs 
from the same method, showing the 
dominance of apparatus or method 
factors so typical of the whole field of 
individual differences. The validity 
diagonal for hunger is higher than the 


measure was pre-sex-opportunity, the 
activity wheel  post-opportunity. 
Note that the high general level 
of heterotrait-heteromethod values 
could be due either to correlation of 
methods variance between the two 
methods, or to correlated trait vari- 
ance. On a priori grounds, however, 
the methods would seem about as in- 
dependent as one would be likely to 





heteroconstruct-heteromethod _ val- 
ues. The diagonal value for sex has 
not been italicized as a_ validity 
coefficient since the obstruction box 


achieve. The predominance of an ap- 
paratus factor for the ‘activity wheel 
is evident from the fact that the cor- 
relation between hunger and thirst 


TABLE 3 


MEASURES OF DRIVES FROM ANDERSON’S DATA 
(N=50) 





Obstruction Box Activity Wheel 











Ai Bi Cy Ay : Be C2 

Obstruction Box 

Hunger Ay (.58) 

Thirst B, 54 b> 3 

Sex Cc, .46 .70 ( ) 
Activity Wheel 

Hunger Ao 48 | .37 (.83) 

Thirst B: .35 se 43 .87 f (.92) 

Post Sex C. 31 af 44 .69 .78 ¢ ) 








Note. —Empty parentheses appear in this and subsequent tables where no appropriate re! ability estimates are 
reported in the original paper. 
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TABLE 4 


SocIAL INTELLIGENCE AND MENTAL ALERTNESS SUBTEST INTERCORRELATIONS FROM 
THORNDIKE’S DATA 


(N=750) 





Compre- 


Memory hension 





Av Bi Aa Bz As Bs 





Memory 


Social Intelligence (Memory for Names & Faces) A; ( ) 


Mental Alertness (Learning Ability) 


Comprehension 
Social Intelligence (Sense of Humor) 
Mental Alertness (Comprehension ) 


Vocabulary 
Social Intelligence (Recog. of Mental State) 
Mental Alertness (Vocabulary) 


(.87) is of the same magnitude as 
their test-retest reliabilities (.83 and 
.92 respectively). 

R. L. Thorndike’s study (1936) of 
the validity of the George Washing- 
ton Social Intelligence Test is the 
classic instance of invalidation by 
high correlation between traits. It in- 
volved computing all of the intercor- 
relations among five subscales of the 
Social Intelligence Test and five sub- 
scales of the George Washington 
Mental Alertness Test. The model of 
the present paper would demand that 
each of the traits, social intelligence 
and mental alertness, be measured by 
at least two methods. While this full 
symmetry was not intended in the 
study, it can be so interpreted with- 
out too much distortion. For both 
traits, there were subtests employing 
acquisition of knowledge during the 
testing period (i.e., learning or mem- 
ory), tests involving comprehension 
of prose passages, and tests that in- 
volved a definitional activity. Table 4 
shows six of Thorndike’s 10 variables 
arranged as a multitrait-multimethod 
matrix. If the three subtests of the 
Social Intelligence Test are viewed 


m ot 3 


A; .30 .31 ( ) 
» © & 


48 ( ) 
Ag £23 2 tt. a ( _) 
48 47 ( ) 


B; .30 .58 § .40 





as three methods of measuring social 
intelligence, then their intercorrela- 
tions (.30, .23, and .31) represent 
validities that are not only lower than 
their corresponding monomethod val- 
ues, but also lower than the hetero- 
trait-heteromethod correlations, pro- 
viding a picture which totally fails to 
establish social intelligence as a sep- 
arate dimension. The Mental Alert- 
ness validity diagonals (.38, .58, and 
.48) equal or exceed the monomethod 
values in two out of three cases, and 
exceed all heterotrait-heteromethod 
control values. These results illus- 
trate the general conclusions reached 
by Thorndike in his factor analysis of 
the whole 10 X10 matrix. 

The data of Table 4 could be used 
to validate specific forms of cognitive 
functioning, as measured by the dif- 
ferent ‘‘methods’” represented by 
usual intelligence test content on the 
one hand and social content on the 
other. Table 5 rearranges the 15 val- 
ues for this purpose. The mono- 
method values and the validity diag- 
onals exchange places, while the 
heterotrait-heteromethod control co- 
efficients are the same in both tables. 














88 D. T. CAMPBELL AND D. W. FISKE 


TABLE 5 


Memory, COMPREHENSION, AND VOCABULARY MEASURED WITH 
SOCIAL AND ABSTRACT CONTENT 











Social Content Abstract Content 








A Bi G. As B CG; 

Social Content 

Memory (Memory for Names and Faces) Ai eu 

Comprehension (Sense of Humor) Bi et 3 

Vocabulary (Recognition of Mental State) Cc, ae 2a { ) 
Abstract Content 

Memory (Learning Ability) Aa a M .': € 4 

Comprehension B2 29 .48 .35 met ) 

Vocabulary C; .30 .40 





47 oo .48 ( ) 





As judged against these latter values, 
comprehension (.48) and vocabulary 
(.47), but not memory (.31), show 
some specific validity. This trans- 
mutability of the validation matrix 
argues for the comparisons within the 
heteromethod block as the most gen- 
erally relevant validation data, and 
illustrates the potential interchange- 
ability of trait and method com- 
ponents. 

Some of the correlations in Chi’s 
(1937) prodigious study of halo effect 
in ratings are appropriate to a multi- 
trait-multimethod matrix in which 
each rater might be regarded as rep- 
resenting a different method. While 
the published report does not make 
these available in detail because it 
employs averaged values, it is appar- 
ent from a comparison of his Tables 
IV and VIII that the ratings gen- 
erally failed to meet the requirement 
that ratings of the same trait by dif- 
ferent raters should correlate higher 
than ratings of different traits by the 
same rater. Validity is shown to the 
extent that of the correlations in the 
heteromethod block, those in the 
validity diagonal are higher than the 
average heteromethod-heterotrait 
values. 

A conspicuously unsuccessful mul- 


titrait-multimethod matrix is pro- 
vided by Campbell (1953, 1956) for 
rating of the leadership behavior of 
officers by themselves and by their 
subordinates. Only one of 11 var- 
iables (Recognition Behavior) met 
the requirement of providing a valid- 
ity diagonal value higher than any of 
the heterotrait-heteromethod values, 
that validity being .29. For none of 
the variables were ‘the validities 
higher than heterotra:t-monomethod 
values. 

A study of attitudes toward au- 
thority and nonauthority figures by 
Burwen and Campbell (1957) con- 
tains a complex multitrait-multi- 
method matrix, one symmetrical ex- 
cerpt from which is shown in Table 6. 
Method variance was strong for most 
of the procedures in this study. 
Where validity was found, it was 
primarily at the level of validity 
diagonal values higher than hetero- 
trait-heteromethod values. As il- 
lustrated in Table 6, attitude toward 
father showed this kind of validity, as 
did attitude toward peers to a lesser 
degree. Attitude toward boss showed 
no validity. There was no evidence 
of a generalized attitude toward 


authority which would:include father 
and boss, although such values as the 
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TABLE 6 
ATTITUDES TOWARD FATHER, Boss, AND PEER, AS MEASURED BY 
INTERVIEW AND CHECK-LIsT OF DESCRIPTIVE TRAITS 

















Interview Trait Check-List 
Ai B; C; Ao Bz C; 
Interview 
(N=57) 
Father Ai ( ) 
Boss B2 64 ( ) 
Peer Cc, .65 .76 ( ) 
Trait Check-List 
(N=155) 
Father Az 40 .08 .09 (.24) 
Boss Be .19 —.10 — .03 .23 (.34) 
Peer C, .27 il .23 21 45 (.55) 


.64 correlation between father and lustrating the assessment of two 
boss as measured by interview might _ traits by four different methods. For 
have seemed to confirm the hypothe- all measures but one, the highest cor- 
sis had they been encountered in iso- relation is the apparatus one, i.e., 
lation. with the other trait measured by the 

Borgatta (1954) has provided a same method rather than with the 
complex multimethod study from same trait measured by a different 
which can be extracted Table 7, il- method. Neither of the traits finds 








TABLE 7 
MULTIPLE MEASUREMENT OF Two SOCIOMETRIC TRAITS 
(N=125) 
Sociometric Observation 
by Others by Self Group In- Role 


teraction Playing 








A: Bi A: Bz A; Bs A, By 





Sociometric by Others 


Popularity Ay ( ) 
Expansiveness B, “2% 263 

Sociometric by Self 
Popularity As AP AR.) 
Expansiveness Be 07 .08 = .32 ) 


Observation of Group Interaction 
Popularity As 


J .18 26 .i1 ( ) 
Expansiveness B; 


5 
.' <® ~<a s& . ft ae 


m %& 


Observation of Role Playing 
Popularity Ay .24 = .14 18 .01 66 .58 ( ) 
Expansiveness By , ae ee oe 6 7 3 ¢ ) 
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any consistent validation by the re- 
quirement that the validity diagonals 
exceed the heterotrait-heteromethod 
control values. As a most minimal 
requirement, it might be asked if the 
sum of the two values in the validity 
diagonal exceeds the sum of the two 
control values, providing a compari- 
son in which differences in reliability 
or communality are roughly par- 
tialled out. This condition is achieved 
at the purely chance level of three 
times in the six tetrads. This matrix 
provides an interesting range of 
methodological independence. The 
two “Sociometric by Others’’ meas- 
ures, while representing the judg- 
ments of the same set of fellow par- 
ticipants, come from distinct tasks: 
Popularity is based upon each par- 
ticipant’s expression of his own 
friendship preferences, while Ex- 
pansiveness is based upon each par- 
ticipant’s guesses as to the other par- 
ticipant’s choices, from which has 
been computed each participant's 
reputation for liking lots of other per- 
sons, i.e., being ‘‘expansive.’’ In line 
with this considerable independence, 
the evidence for-a method factor is 
relatively low in comparison with the 
observational procedures. Similarly, 
the two ‘‘Sociometric by Self’’ meas- 
ures represent quite separate tasks, 
Popularity coming from his estimates 
of the choices he will receive from 
others, Expansiveness from the num- 
ber of expressions of attraction to 
others which he makes on the socio- 
metric task. In contrast, the meas- 
ures of Popularity and Expansiveness 
from the observations of group inter- 
action and the role playing not only 
involve the same specific observers, 
but in addition the observers rated 
the pair of variables as a part of the 
same rating task in each situation. 
The apparent degree of method vari- 
ance within each of the two observa- 


tional situations, and the apparent 
sharing of method variance between 
them, is correspondingly high. 

In another paper by Borgatta 
(1955), 12 interaction process vari- 
ables were measured by quantitative 
observation under two conditions, 
and by a projective test. In this test, 
the stimuli were pictures of groups, 
for which the S generated a series of 
verbal interchanges; these were then 
scored in Interaction Process Analy- 
sis categories. For illustrative pur- 
poses, Table 8 presents the five traits 
which had the highest mean com- 
munalities in the over-all factor anal- 
ysis. Between the two highly sim- 
ilar observational methods, valida- 
tion is excellent: trait variance runs 
higher than method variance; valid- 
ity diagonals are in general higher 
than heterotrait values of both the 
heteromethod and monomethods 
blocks, most unexceptionably so for 
Gives Opinion and Gives Orientation. 
The pattern of correlation among the 
traits is also in general confirmed. 

Of greater interest because of the 
greater independence of methods are 
the blocks involving the projective 
test. Here the validity picture is 
much poorer. Gives Orientation 
comes off best, its projective test 
validity values of .35 and .33 being 
bested by only three monomethod 
values and by no hetérotrait-hetero- 
method values within the projective 
blocks. All of the other validities are 
exceeded by some hetérotrait-hetero- 
method value. 

The projective test specialist may 
object to the implicit expectations of 
a one-to-one correspondence between 
projected action and: overt action. 
Such expectations should not be at- 
tributed to Borgatta, and are not 
necessary to the metiod here pro- 
posed. For the simple symmetrical 
model of this paper, it has: been as- 





91 

















(StIT=N) 
isa | aAILOa[OUg V GNV ONIAV I AION GAAANASAG ‘YOIAVHAG AAYY AAAUASHDC NI SATAVINVA SSADOUG NOILOVAALNI 


< 
x 
pCd) er ww €2 0° 70 8%" An 2 |) Sn] Oo 2 a JUILDIIBESIP SMOYS 
S ( ) 7 67° Le" 00° ee" 6" 00° £0° we ~ or° 60° _ * *d UONLIUILO SBAIL) 
iS se x ., weer U6SlCUS . ee-nrt & A uorutdo saatcy 
a cae: or" 90° 2O'— OF i) lt 80° sO" IZ $0" *q uonsedans S9AIL) 
z c 42 a LV of" ct° ZI 80° (ae | LY" 02° ty AYJEPIOS SMOYUS 
a ysa] AANefO1g 
a 
= ’ 
Sean ( ) We- €2° Lt 6¢° 0s" Zi'— FI’ 9¢° I¢° ae | JUSIBIIDeSIP SMOYS 
re ( ) OF td 0" 80° bl tt (a A ee 7d UOIJEJUBLO SAIL) 
» ( ) OF 10° cl 8e° 09° Lo’ st° ~~ UOIUIGO S9AIZ) 
a ( . = L0° $7" 00° ce 9I° tg uoNsasans saarry 
~ . 3 6z° O° 80° «fh fh *V AJJEPIOS SMOYS 
ee SurArjq ayoy 
& 
5 ) ZO iz" lt’ $e" 'W JUIWIBIIZeSIP SMOYS 

| “ ( » 2e* 97 tl’ - iq UOIRJUDIIO SIAI‘) 
S ( ) 2 €t «*, Ido SaAid) 
= ( ) ee ig UOI|SsaBINs SaAiry 
eal ( ) ty Ayisepijos smoys 
mz 4 IABYog 991 J 
S ty tq t> tg ty | tq ty tg ty 'qW ig Iy rs ly 
= ISI] VAIQDa[oIg ButAryg oy JOLARYIY 914 
iS 
w 
S 
~ 
_ 


8 ATAVL 


r 


— 











92 D. T. CAMPBELL AND D. W. FISKE 


TABLE 9 
Mayo's INTERCORRELATIONS BETWEEN OBJECTIVE AND RATING 
MEASURES OF INTELLIGENCE AND EFFORT 
(N=166) 











Peer Ratings Objective 








Ai Bi Ag Bz 
Peer Rating 
Intelligence Ai (.85) 
Effort B, 66 (.84) 
Objective Measures 
Intelligence Ag 46 .29 3 
Effort Bz .46 .40 .10 ¢ ) 





sumed that the measures are labeled 
in correspondence with the correla- 
tions expected, 1.e., in correspondence 
with the traits that the tests are 
alleged to diagnose. Note that in 
Table 8, Gives Opinion is the best 
projective test predictor of both free 
behavior and role playing Shows Dis- 
agreement. Were a proper theoretical 
rationale available, these values 
might be regarded as validities. 
Mayo (1956) has made an analysis 
of test scores and ratings of effort and 
intelligence, to estimate the contribu- 
tion of halo (a kind of methods vari- 
ance) to ratings. As Table 9 shows, 
the validity picture is ambiguous. 
The method factor or halo effect for 
ratings is considerable although the 
correlation between the two ratings 
(.66) is well below their reliabilities 





(.84 and .85). The objective meas- 
ures share no appreciable apparatus 
overlap because they were independ- 
ent operations. In spite of Mayo’s 
argument that the ratings have some 
valid trait variance, the .46 hetero- 
trait-heteromethod value seriously de- 
preciates the otherwise impressive .46 
and .40 validity values. 

Cronbach (1949, p. 277) and Ver- 
non (1957, 1958) have both discussed 
the multitrait-multimethod matrix 
shown in Table 10, based upon data 
originally presented by H. S. Conrad. 
Using an approximative technique, 
Vernon estimates that 61% of the 
systematic variance is due to a gen- 
eral factor, that 213% is due to the 
test-form factors specific to verbal or 
to pictorial forms of items, and that 
but 113% is due to the content fac- 


TABLE 10 
MECHANICAL AND ELECTRICAL Facts MEASURED BY VERBAL AND PICTORIAL ITEMS 








Verbal Items Pictorial Items 











Ay B, Az Bs 
Verbal Items 
Mechanical Facts Ay (.89) 
Electrical Facts B, .63 C30) 
Pictorial Items 
Mechanical Facts A: 61 .45 (.82) 
Electrical Facts B, .64 (.67) 
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tors specific to electrical or to mechan- 
ical contents. Note that for the pur- 
poses of estimating validity, the in- 
terpretation of the general factor, 
which he estimates from the .49 and 
45 heterotrait-heteromethod values, 
is equivocal. It could represent de- 
sired competence variance, represent- 
ing components common to both elec- 
trical and mechanical skills—perhaps 
resulting from general industrial shop 
experience, common ability compo- 
nents, overlapping learning situations, 
and the like. On the other hand, this 
general factor could represent over- 
lapping method factors, and be due to 
the presence in both tests of multiple 
choice item format, IBM answer 
sheets, or the heterogeneity of the Ss 
in conscientiousness, test-taking mo- 
tivation, and test-taking sophistica- 
tion. Until methods that are still 


more different and traits that are 


still more independent are introduced 
into the validation matrix, this gen- 
eral factor remains uninterpretable. 


From this standpoint it can be seen 
that 213% is a very minimal estimate 
of the total test-form variance in the 
tests, as it represents only test-form 
components specific to the verbal or 
the pictorial items, i.e., test-form 
components which the two forms do 
not share. Similarly, and more hope- 
fully, the 113% content variance is a 
very minimal estimate of the total 
true trait variance of the tests, repre- 
senting only the true trait variance 
which electrical and mechanical 
knowledge do not share. 

Carroll (1952) has provided data 
on the Guilford-Martin Inventory of 
Factors STDCR and related ratings 
which can be rearranged into the 
matrix of Table 11. (Variable R has 
been inverted to reduce the number 
of negative correlations.) Two of the 
methods, Self Ratings and Inventory 
scores, can be seen as sharing method 


variance, and thus as having an in- 
flated validity diagonal. The more 
independent heteromethod blocks in- 
volving Peer Ratings show some evi- 
dence of discriminant and convergent 
validity, with validity diagonals av- 
eraging .33 (Inventory XPeer Rat- 
ings) and .39 (Self Ratings X Peer 
Ratings) against heterotrait-hetero- 
method control values averaging .14 
and .16. While not intrinsically im- 
pressive, this picture is nonetheless 
better than most of the validity ma- 
trices here assembled. Note that the 
Self Ratings show slightly higher 
validity diagonal elevations than do 
the Inventory scores, in spite of the 
much greater length and undoubtedly 
higher reliability of the latter. In ad- 
dition, a method factor seems almost 
totally lacking for the Self Ratings, 
while strongly present for the Inven- 
tory, so that the Self Ratings come 
off much the best if true trait vari- 
ance is expressed as a proportion of 
total reliable variance (as Vernon 
[1958] suggests). The method factor 
in the STDCR Inventory is undoubt- 
edly enhanced by scoring the same 
item in several scales, thus contribut- 
ing correlated error variance, which 
could be reduced without loss of reli- 
ability by the simple expedient of 
adding more equivalent items and 
scoring each item in only one scale. 
It should be noted that Carroll makes 
explicit use of the comparison of the 
validity diagonal with the hetero- 
trait-heteromethod values as a valid- 
ity indicator. 


RATINGS IN THE ASSESSMENT STUDY 
OF CLINICAL PSYCHOLOGISTS 


The illustrations of multitrait- 
multimethod matrices presented so 
far give a rather sorry picture of the 
validity of the measures of individual 
differences involved. The typical 
case shows an excessive amount of 
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method variance, which usually ex- 
ceeds the amount of trait variance. 
This picture is certainly not as a re- 
sult of a deliberate effort to select 
shockingly bad examples: these are 
ones we have encountered without at- 
tempting an exhaustive coverage of 
the literature. The several unpub- 
lished studies of which we are aware 
show the same picture. If they seem 
more disappointing than the general 
run of validity data reported in the 
journals, this impression may very 
well be because the portrait of valid- 
ity provided by isolated values 
plucked from the validity diagonal is 
deceptive, and uninterpretable in 
isolation from the total matrix. Yet 
it is clear that few of the classic ex- 
amples of successful measurement of 
individual differences are involved, 
and that in many of the instances, the 
quality of the data might have been 
such as to magnify apparatus factors, 
etc. A more nearly ideal set of per- 


sonality data upon which to illus- 


trate the method therefore 
sought in the multiple application of 
a set of rating scales in the assess- 
ment study of clinical psychologists 
(Kelly & Fiske, 1951). 

In that study, ‘‘Rating Scale A 
contained 22 traits referring to ‘‘be- 
havior which can be directly observed 
on the surface.”’ In using this scale 
the raters were instructed to “‘disre- 
gard any inferences about underlying 
dynamics or causes” (p. 207). The 
Ss, first-year clinica! psychology stu- 
dents, rated themselves and also their 
three teammates with whom they 
had participated in the various as- 
sessment procedures and with whom 
they had lived for six days. The 
median of the three teammates’ rat- 
ings was used for the Teammate 
score. The Ss were also rated on these 
22 traits by the assessment staff. Our 
analysis uses the Final Pooled rat- 


was 


ings, which were agreed upon by 
three staff members after discussion 
and review of the enormous amount 
of data and the many other ratings on 
each S. Unfortunately for our pur- 
poses, the staff members saw the rat- 
ings by Self and Teammates before 
making theirs, although presumably 
they were little influenced by these 
data because they had so much other 
evidence available to them. (See Kel- 
ly & Fiske, 1951, especially p. 64.) 
The Self and Teammate ratings rep- 
resent entirely separate ‘‘methods” 
and can be given the major emphasis 
in evaluating the data to be pre- 
sented. 

In a previous analysis of these data 
(Fiske, 1949), each of the three heter- 
otrait-monomethod triangles was 
computed and factored. To provide 
a multitrait-multimethod matrix, the 
1452 heteromethod correlations have 
been computed especially for this re- 
port.2 The full 66X66 matrix with 
its 2145 coefficients is obviously too 
large for presentation here, but will 
be used in analyses that follow. To 
provide an illustrative sample, Table 
i2 presents the interrelationships 
among five variables, selecting the 
one best representing each of the five 
recurrent factors discovered in Fiske’s 
(1949) previous analysis of the mono- 
method matrices. (These were chosen 
without regard to their validity as 
indicated in the heteromethod blocks. 
Assertive—No. 3 reflected—was se- 
lected to represent Recurrent Factor 
5 because Talkative had also a high 

2 We are indebted to E. Lowell Kelly for 
furnishing the V.A. assessment date to us, and 
to Hugh Lane for producing the matrix of 
intercorrelations. 

In the original report the correlations were 
based upon 128 men. The present analyses 
were based on only 124 of these cases because 
of clerical errors. This reduction in N leads 
to some very minor discrepancies between 
these values and those previously reported’ 
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loading on the first recurrent factor.) 

The picture presented in Table 12 
is, we believe, typical of the best 
validity in personality trait ratings 
that psychology has to offer at the 
present time. It is comforting to note 
that the picture is better than most 
of those previously examined. Note 
that the validities for Assertive ex- 
ceed heterotrait values of both the 
monomethod and heteromethod tri- 
angles. Cheerful, Broad Interests, 
and Serious have validities exceeding 
the heterotrait-heteromethod values 
with two exceptions. Only for Un- 
shakable Poise does the evidence of 
validity seem trivial. The elevation 
of the reliabilities above the hetero- 
trait-monomethod triangles is further 
evidence for discriminant validity. 

A comparison of Table 12 with the 
full matrix shows that the procedure 
of having but one variable to repre- 
sent each factor has enhanced the ap- 
pearance of validity, although not 
necessarily in a misleading fashion. 


Where several variables are all highly 


loaded on the same factor, their 
“true’’ level of intercorrelation is 
high. Under these conditions, sam- 
pling errors can depress validity diag- 
onal values and enhance others to 
produce occasional exceptions to the 
validity picture, both in the hetero- 
trait-monomethod matrix and in the 
heteromethod-heterotrait triangles. 
In this instance, with an N of 124, the 
sampling error is appreciable, and 
may thus be expected to exaggerate 
the degree of invalidity. 

Within the monomethod sections, 
errors of measurement will be cor- 
related, raising the general level of 
values found, while within the heter- 
omethods block, measurement errors 
are independent, and tend to lower 
the values both along the validity 
diagonal and in the heterotrait tri- 
angles. These effects, which may also 


be stated in terms of method factors 
or shared confounded irrelevancies, 
operate strongly in these data, as 
probably in all data involving rat- 
ings. In such cases, where several 
variables represent each factor, none 
of the variables consistently meets 
the criterion that validity values ex- 
ceed the corresponding values in the 
monomethod triangles, when the full 
matrix is examined. 

To summarize the validation pic- 
ture with respect to comparisons of 
validity values with other hetero- 
method values in each block, Table 
13 has been prepared. For each trait 
and for each of the three hetero- 
method blocks, it presents the value 
of the validity diagonal, the highest 
heterotrait value involving that trait, 
and the number out of the 42 such 
heterotrait values which exceed the 
validity diagonal in magnitude. (The 
number 42 comes from the grouping 
of the 21 other column values and the 
21 other row values for the column 
and row intersecting at the given 
diagonal value.) 

On the requirement that the valid- 
ity diagonal exceed all others in its 
heteromethod block, none of the 
traits has a completely perfect record, 
although some come close. Assertive 
has only one trivial exception in the 
Teammate-Self block. Talkative has 
almost as good a record, as does 
Imaginative. Serious has but two in- 
consequential exceptions and Interest 
in Women three. These traits stand 
out as highly valid in both self- 
description and reputation. Note 
that the actual validity coefficients of 
these four traits range from but .22 to 
.82, or, if we concentrate on the 
Teammate-Self block as most cer- 
tainly representing independent 
methods, from but .31 to .46. While 
these are the best traits, it seems that 
most of the traits have far above 
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TABLE 13 
VALIDITIES OF TRAITS IN THE ASSESSMENT STUDY OF CLINICAL PSYCHOLOGISTS, 





AS JUDGED BY THE HETEROMETHOD COMPARISONS 











Staff-Teammate Staff-Self Teammate-Self 

- High- No. ; High- = , High- No. 

al. est Highe Val. es Hiche Val. est Hicher 

ee itttiael in, Het. “ne 
1. Obstructiveness* 30 = .34 2 16. 27 9 .19 24 1 
2. Unpredictable 34 = .26 0 18 8.24 3 03 19 29 
3. Assertive* Pa .65 0 .48 45 0 46 48 1 
4. Cheerful* .53 .60 2 .42 .40 0 .24 .38 5 
5. Serious* 43 .35 0 .22 ae 2 31 .24 0 
6. Cool, Aloof .49 48 0 .2¢ .46 10 .02 .34 36 
7. Unshakable Poise .20 .40 16 aa .27 4 .14 .19 10 
8. Broad Interests* AT 46 0 .26 ae 6 35 ae 0 
9. Trustful .26 .34 5 .08 e 19 a .17 9 
10. Self-centered .30 .34 2 ei ay + | 6 —.07 .19 36 
11. Talkative* .82 65 0 AT 45 0 43 48 1 
12. Adventurous 45 .60 6 .28 .30 2 .16 .36 14 
13. Socially Awkward 45.37 0 06.21 28 04 = 16 30 
14. Adaptable* 44 40 0 ae sae 10 Pe .29 8 
15. Self-sufficient* 32 .33 1 13 18 5 .18 mb 0 
16. Worrying, Anxious* 41 37 0 23 33 5 “35 .16 1 
17. Conscientious .26 33 4 11 san 19 21 .23 2 
18. Imaginative* 43 46 1 a2 31 0 36 8.32 0 
19. Interest in Women* 42 43 2 55 .38 0 Be 40 1 
20. Secretive, Reserved* 40 .58 5 .38 40 2 32 35 3 
21. Independent Minded 39 ~=—.42 2 .08 = =.25 19 .21 .30 3 
22. Emotional Expression* 62 .63 1 31 46 5 5 ee 10 





Note.—Val. =value in validity diagonal; Highest Het. =highest heterotrait value; No. Higher =number of 


heterotrait values exceeding the validity diagonal. 


Trait names which have validities in all three heteromethod blocks significantly greater than the heterotrait- 


heteromethod values at the .001 level. 


chance validity. All those having 10 
or fewer exceptions have a degree of 
validity significant at the .001 level 
as crudely estimated by a one-tailed 
sign test.* All but one of the variables 
meet this level for the Staff-Team- 
mate block, all but four for the Staff- 


3 If we take the validity value as fixed (ig- 
noring its sampling fluctuations), then we can 
determine whether the number of values 
larger than it in its row and column is less than 
expected on the null hypothesis that half the 
values would be above it. This procedure re- 
quires the assumption that the position (above 
or below the validity value) of any one of 
these comparison values is independent of the 
position of each of the others, a dubious as- 
sumption when common methods and trait 
variance are present. 


Self block, all but five for the most 
independent block, Teammate-Self. 
The exceptions to significant validity 
are not parallel from column to col- 
umn, however, and only 13 of 22 
variables have .001 significant valid- 
ity in all three blocks. These are indi- 
cated by an asterisk in Table 13. 
This highly significant general 
level of validity must not obscure the 
meaningful problem created by the 
occasional exceptions, even for the 
best variables. The excellent traits 
of Assertive and Talkative provide 
a case in point. In terms of Fiske’s 
original analysis, both have high 
loadings on the recurrent factor 
“Confident self-expression’ (repre- 
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sented by Assertive in Table 12). 
Talkative also had high loadings on 
the recurrent factor of Social Adapta- 
bility (represented by Cheerful in 
Table 12). We would expect, there- 
fore, both high correlation between 
them and significant discrimination 
as well. And even at the common 
sense level, most psychologists would 
expect fellow psychologists to dis- 
criminate validly between assertive- 
ness (nonsubmissiveness) and talka- 
tiveness. Yet in the Teammate-Self 
block, Assertive rated by self cor- 
relates .48 with Talkative by team- 
mates, higher than either of their 
validities in this block, .43 and .46. 

In terms of the average values of 
the validities and the frequency of 
exceptions, there is a distinct trend 
for the Staff-Teammate block to 
show the greatest agreement. This 
can be attributed to several factors. 
Both represent ratings from the ex- 
ternal point of view. Both are aver- 
aged over three judges, minimizing 
individual biases and undoubtedly in- 
creasing reliabilities. Moreover, the 
Teammate ratings were available to 
the Staff in making their ratings. An- 
other effect contributing to the less 
adequate convergence and discrim- 
ination of Self ratings was a response 
set toward the favorable pole which 
greatly reduced the range of these 
measures (Fiske, 1949, p. 342). In- 
spection of the details of the instances 
of invalidity summarized in Table 13 
shows that in most instances the ef- 
fect is attributable to the high spec- 
ificity and low communality for the 
self-rating trait. In these instances, 
the column and row intersecting at 
the low validity diagonal are asym- 
metrical as far as general level of cor- 
relation is concerned, a fact covered 
over by the condensation provided 
in Table 13. 


The personality psychologist is 


initially predisposed to reinterpret 
self-ratings, to treat them as symp- 
toms rather than to interpret them 
literally. Thus, we were alert to in- 
stances in which the self ratings were 
not literally interpretable, yet none- 
theless had a diagnostic significance 
when properly ‘‘translated.”” By and 
large, the instances of invalidity of 
self-descriptions found in this assess- 
ment study are not of this type, but 
rather are to be explained in terms of 
an absence of communality for one 
of the variables involved. In general, 
where these self descriptions are in- 
terpretable at all, they are as literally 
interpretable as are teammate de- 
scriptions. Such a finding may, of 
course, reflect a substantial degree of 
insight on the part of these Ss. 

The general success in discriminant 
validation coupled with the parallel 
factor patterns found in Fiske’s 
earlier analysis of the three intra- 
method matrices seemed to justify an 
inspection of the factor pattern valid- 
ity in this instance. One possible pro- 
cedure would be to do a single analy- 
the whole 66X66 matrix. 
Other approaches focused upon sep- 
arate factoring of heteromethods 
blocks, matrix by matrix, could also 
be suggested. Not only would such 
methods be extremely tedious, but in 
addition they would leave undeter- 
mined the precise comparison of 
factor-pattern similarity. Correlat- 
ing factor loadings over the popula- 
tion of variables was employed for 
this purpose by Fiske (1949) but 
while this provided for the identifica- 
tion of recurrent factors, no single 
over-all index of factor pattern sim- 
ilarity was generated. Since our im- 
mediate interest was in confirming a 
pattern of interrelationships, rather 
than in describing it, an efficient 
short cut was available: namely to 
test the similarity of the sets of heter- 


sis of 
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otrait values by correlation coeffi- 
cients in which each entry repre- 
sented the size values of the given 
heterotrait coefficients in two differ- 
ent matrices. For the full matrix, 
such correlations would be based 
upon the N of the 22X21/2 or 231 
specific heterotrait combinations. 
Correlations were computed between 
the Teammate and Self monometh- 
ods matrices, selected as maximally 
independent. (The values to follow 
were computed from the original cor- 
relation matrix and are somewhat 
higher than that which would be ob- 
tained from a reflected matrix.) The 
similarity between the two mono- 
methods matrices was .84, corrob- 
orating the factor-pattern similarity 
between these matrices described 
more fully by Fiske in his parallel 
factor analyses of them. To carry 
this mode of analysis into the hetero- 
method block, this block was treated 
as though divided into two by the 
validity diagonal, the above diagonal 
values and the below diagonal repre- 
senting the maximally independent 
validation of the heterotrait correla- 
tion pattern. These two correlated 
.63, a value which, while lower, shows 
an impressive degree of confirmation. 
There remains the question as to 
whether this pattern upon which the 
two heteromethod-heterotrait _ tri- 
angles agree is the sarne one found in 
common between the two mono- 
method triangles. The intra-Team- 
mate matrix correlated with the two 
heteromethod triangles .71 and .71. 
The intra-Self matrix correlated with 
the two .57 and .63. In general, then, 
there is evidence for validity of the 
intertrait relationship pattern. 


DISCUSSION 


Relation to construct validity. While 
the validational criteria presented are 
explicit or implicit in the discussions 
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of construct validity (Cronbach & 
Meehl, 1955; APA, 1954), this pa- 
per is primarily concerned with the 
adequacy of tests as measures of a 
construct rather than with the ade- 
quacy of a construct as determined 
by the confirmation of theoretically 
predicted associations with measures 
of other constructs. We believe that 
before one can test the relationships 
between a specific trait and other 
traits, one must have some confidence 
in one’s measures of that trait. Such 
confidence can be supported by evi- 
dence of convergent and discriminant 
validation. Stated in different words, 
any conceptual formulation of trait 
will usually include implicitly the 
proposition that this trait is a re- 
sponse tendency which can be ob- 
served under more than one experi- 
mental condition and that this trait 
can be meaningfully differentiated 
from other traits. The testing of 
these two propositions must be prior 
to the testing of other propositions to 
prevent the acceptance of erroneous 
conclusions. For exemple, a con- 
ceptual framework might postulate a 
large correlation between Traits A 
and B and no correlation between 
Traits A and C. If the experimenter 
then measures A and B by one 
method (e.g., questionnaire) and C 
by another method (such as the meas- 
urement of overt behavior in a situa- 
tion test), his findings may be con- 
sistent with his hypotheses solely as 
a function of method variance com- 
mon to his measures of A and B but 
not to C. 

The requirements of this paper are 
intended to be as appropriate to the 
relatively atheoretical efforts typical 
of the tests and measurements field 
as to more theoretical efforts. This 
emphasis on validational criteria ap- 
propriate to our present atheoretical 
level of test construction is not at all 














VALIDATION BY THE MULTITRAIT-MULTIMETHOD MATRIX 


incompatible with a recognition of 
the desirability of increasing the ex- 
tent to which all aspects of a test and 
the testing situation are determined 
by explicit theoretical considerations, 
as Jessor and Hammond have advo- 
cated (Jessor & Hammond, 1957). 

Relation to operationalism. Under- 
wood (1957, p. 54) in his effective 
presentation of the operationalist 
point of view shows a realistic aware- 
ness of the amorphous type of theory 
with which most psychologists work. 
He contrasts a psychologist’s ‘‘lit- 
erary’’ conception with the latter’s 
operational definition as represented 
by his test or other measuring instru- 
ment. He recognizes the importance 
of the literary definition in communi- 
cating and generating science. He 
cautions that the operational defini- 
tion ‘“‘may not at all measure the 
process he wishes to measure; it may 
measure something quite different” 
(1957, p. 55). He does not, however, 
indicate how one would know when 
one was thus mistaken. 

The requirements of the present 
paper may be seen as an extension of 
the kind of operationalism Under- 
wood has expressed. The test con- 
structor is asked to generate from his 
literary conception or private con- 
struct not one operational embodi- 
ment, but two or more, each as dif- 
ferent in research vehicle as possible. 
Furthermore, he is asked to make ex- 
plicit the distinction between his new 
variable and other variables, distinc- 
tions which are almost certainly im- 
plied in his literary definition. In his 
very first validational efforts, before 
he ever rushes into print, he is asked 
to apply the several methods and sev- 
eral traits jointly. His literary defini- 
tion, his conception, is now best rep- 
resented in what his independent 
measures of the trait hold distinc- 
tively in common. The multitrait- 
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multimethod matrix is, we believe, 
an important practical first step in 
avoiding ‘‘the danger... that the 
investigator will fall into the trap of 
thinking that because he went from 
an artistic or literary conception 
. . . to the construction of items for a 
scale to measure it, he has validated 
his artistic conception”’ (Underwood, 
1957, p. 55). In contrast with the 
single operationalism now dominant 
in psychology, we are advocating a 
multiple operationalism, a convergent 
operationalism (Garner, 1954; Garner, 
Hake, & Eriksen, 1956), a methodologi- 
cal triangulation (Campbell: 1953, 
1956), an operational delineation 
(Campbell, 1954), a convergent valida- 
tion. 

Underwood's presentation and that 
of this paper as a whole imply moving 
from concept to operation, a sequence 
that is frequent in science, and per- 
haps typical. The same point can be 
made, however, in inspecting a tran- 
sition from operation to construct. 
For any body of data taken from a 
single operation, there is a subinfinity 
of interpretations possible; a sub- 
infinity of concepts, or combinations 
of concepts, that it could represent. 
Any single operation, as representa- 
tive of concepts, is equivocal. In an 
analogous fashion, when we view the 
Ames distorted room from a fixed 
point and through a single eye, the 
data of the retinal pattern are equiv- 
ocal, in that a subinfinity of hexa- 
hedrons could generate the same pat- 
tern. The addition of a second view- 
point, as through binocular parallax, 
greatly reduces this equivocality, 
greatly limits the constructs that 
could jointly account for both sets of 
data. In Garner’s (1954) study, the 
fractionation measures from a single 
method were equivocal—they could 
have been a function of the stimulus 
distance being fractionated, or they 
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could have been a function of the 
comparison stimuli used in the judg- 
ment process. A multiple, convergent 
operationalism reduced this equivo- 
cality, showing the latter conceptual- 
ization to be the appropriate one, and 
revealing a preponderance of meth- 
ods variance. Similarly for learning 
studies: in identifying constructs 
with the response data from animals 
in a specific operational setup there is 
equivocality which can operationally 
be reduced by introducing transposi- 
tion tests, different operations so de- 
signed as to put to comparison the 
rival conceptualizations (Campbell, 
1954). 

Garner’s convergent operational- 
ism and our insistence on more than 
one method for measuring each con- 
cept depart from Bridgman’s early 
position that “if we have more than 
one set of operations, we have more 
than one concept, and strictly there 
should be a separate name to cor- 
respond to each different set of op- 
erations” (Bridgman, 1927, p. 10). 
At the current stage of psychological 
progress, the crucial requirement is 
the demonstration of some conver- 
gence, not complete congruence, be- 
tween two distinct sets of operations. 
With only one method, one has no 
way of distinguishing trait variance 
from unwanted method variance. 
When psychological measurement 
and conceptualization become better 
developed, it may well be appropri- 
ate to differentiate conceptually be- 
tween Trait-Method Unit A; and 
Trait-Method Unit A», in which 
Trait A is measured by different 
methods. More likely, what we have 
called method variance will be speci- 
fied theoretically in terms of a set of 
constructs. (This has in effect been 
illustrated in the discussion above in 
which it was noted that the response 
set variance might be viewed as 
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trait variance, and in the rearrange- 
ment of the social intelligence ma- 
trices of Tables 4 and 5.) It will then 
be recognized that measurement pro- 
cedures usually involve several the- 
oretical constructs in joint applica- 
tion. Using obtained measurements 
to estimate values for a single con- 
struct under this condition still re- 
quires comparison of complex meas- 
ures varying in their trait composi- 
tion, in something like a multitrait- 
multimethod matrix. Mill’s joint 
method of similarities and differences 
still epitomizes much about the ef- 
fective experimental clarification of 
concepts. 

The evaluation of a multitrait-multi- 
method matrix. The evaluation of the 
correlation matrix formed by inter- 
correlating several trait-method units 
must take into consideration the 
many factors which are known to 
affect the magnitude of correlations. 
A value in the validity diagonal must 
be assessed in the light of the reliabil- 
ities of the two measures involved: 
e.g., a low reliability for Test Ag 
might exaggerate the apparent 
method variance in Test A;. Again, 
the whole approach assumes ade- 
quate sampling of individuals: the 
curtailment of the sample with re- 
spect to one or more traits will de- 
press the reliability coefficients and 
intercorrelations involving these 
traits. While restrictions of range 
over all traits produces serious diffi- 
culties in the interpretation of a mul- 
titrait-multimethod matrix and 
should be avoided whenever possible, 
the presence of different degrees of 
restriction on different traits is the 
more serious hazard to meaningful 
interpretation. 

Various _ statistical treatments 
for multitrait-multimethod matrices 
might be developed. We have con- 
sidered rough tests for the elevation 
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of a value in the validity diagonal 
above the comparison values in its 
row and column. Correlations be- 
tween the columns for variables 
measuring the same trait, variance 
analyses, and factor analyses have 
been proposed to us. However, the 
development of such statistical meth- 
ods is beyond the scope of this paper. 
We believe that such summary sta- 
tistics are neither necessary nor ap- 
propriate at this time. Psychologists 
today should be concerned not with 
evaluating tests as if the tests were 
fixed and definitive, but rather with 
developing better tests. We believe 
that a careful examination of a multi- 
trait-multimethod matrix will indi- 
cate to the experimenter what his 
next steps should be: it will indicate 
which methods should be discarded 
or replaced, which concepts need 
sharper delineation, and which con- 
cepts are poorly measured because of 
excessive or confounding method var- 
iance. Validity judgments based on 
such a matrix must take into account 
the stage of development of the con- 
structs, the postulated relationships 
among them, the level of technical 
refinement of the methods, the rela- 
tive independence of the methods, 
and any pertinent characteristics of 
the sample of Ss. We are proposing 
that the validational process be 
viewed as an aspect of an ongoing 


program for improving measuring 
procedures and that the “validity 
coefficients” obtained at any one 


stage in the process be interpreted in 
terms of gains over preceding stages 
and as indicators of where further ef- 
fort is needed. 

The design of a multitrait-multi- 
method matrix. The several methods 
and traits included in a validational 
matrix should be selected with care. 
The several methods used to measure 
each trait should be appropriate to 
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the trait as conceptualized. Although 
this view will reduce the range of 
suitable methods, it will rarely re- 
strict the measurement to one opera- 
tional procedure. 

Wherever possible, the several 
methods in one matrix should be com- 
pletely independent of each other: 
there should be no prior reason for 
believing that they share method 
variance. This requirement is neces- 
sary to permit the values in the heter- 
omethod-heterotrait triangles to ap- 
proach zero. If the nature of the 
traits rules out such independence 
of methods, efforts should be made to 
obtain as much diversity as possible 
in terms of data-sources and classifi- 
cation processes. Thus, the classes of 
stimuli or the background situations, 
the experimental contexts, should be 
different. Again, the persons provid- 
ing the observations should have dif- 
ferent roles or the procedures for 
scoring should be varied. 

Plans for a validational matrix 
should take into account the differ- 
ence between the interpretations re- 
garding convergence and discrimina- 
tion. It is sufficient to demonstrate 
convergence between two clearly dis- 
tinct methods which show little over- 
lap in the heterotrait-heteromethod 
triangles. While agreement between 
several methods is desirable, conver- 
gence between two isa satisfactory 
minimal requirement. Discrimina- 
tive validation is not so easily 
achieved. Just as it is impossible to 
prove the null hypothesis, or that 
some object does not exist, so one 
can never establish that a trait, as 
measured, is differentiated from all 
other traits. One can only show that 
this measure of Trait A has little 
overlap with those measures of B and 
C, and no dependable generalization 
beyond B and C can be made. For 
example, social poise could probably 
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be readily discriminated from aes- 
thetic interests, but it should also be 
differentiated from leadership. 

Insofar as the traits are related and 
are expected to correlate with each 
other, the monomethod correlations 
will be substantial and heteromethod 
correlations between traits will also 
be positive. For ease of interpreta- 
tion, it may be best to include in the 
matrix at least two traits, and prefer- 
ably two sets of traits, which are 
postulated to be independent of each 
other. 

In closing, a word of caution is 
needed. Many multitrait-multi- 
method matrices will show no con- 
vergent validation: no relationship 
may be found between two methods 
of measuring a trait. In this common 
situation, the experimenter should 
examine the evidence in favor of sev- 
eral alternative propositions: (a) 
Neither method is adequate for meas- 
uring the trait; (6) One of the two 
methods does not really measure the 
trait. (When the evidence indicates 
that a method does not measure the 
postulated trait, it may prove to 
measure some other trait. High cor- 
relations in the heterotrait-hetero- 
method triangles may provide hints 
to such possibilities.) (c) The trait is 
not a functional unity, the response 
tendencies involved being specific to 


D. T. CAMPBELL AND D. W. FISKE 


the nontrait attributes of each test. 
The failure to demonstrate conver- 
gence may lead to conceptual devel- 
opments rather than to the abandon- 
ment of a test. 


SUMMARY 


This paper advocates a_ valida- 
tional process utilizing a matrix of 
intercorrelations among tests repre- 
senting at least two traits, each meas- 
ured by at least two methods. Meas- 
ures of the same trait should correlate 
higher with each other than they do 
with measures of different traits in- 
volving separate methods. Ideally, 
these validity values should also be 
higher than the correlations among 
different traits measured by the same 
method. 

Illustrations from the literature 
show that these desirable conditions, 
as a set, are rarely met. Method or 
apparatus factors make very large 
contributions to psychological meas- 
urements. 

The notions of convergence be- 
tween independent measures of the 
same trait and discrimination be- 
tween measures of different traits are 
compared with previously published 
formulations, such as construct valid- 
ity and convergent operationalism. 
Problems in the application of this 
validational process are considered. 


REFERENCES 


AMERICAN PSYCHOLOGICAL ASSOCIATION. 
Technical recommendations for psychologi- 
cal tests and diagnostic techniques. Psy- 
chol. Bull., Suppl., 1954, 51, Part 2, 1-38. 

ANDERSON, E. E. Interrelationship of drives 
in the male albino rat. I. Intercorrelations 
of measures of drives. J. comp. Psychol., 
1937, 24, 73-118. 

Ayer, A. J. The problem of knowledge. 
York: St Martin's Press, 1956. 

Boreoatrta, E. F. Analysis of social interaction 
and sociometric perception. 
1954, 17, 7-32. 

Borcatta, E. F. Analysis of social interac- 


New 


Sociometry, 


tion: Actual, role-playing, and projective. 
J. abnorm. soc. Psychol., 1955, 51, 394-405. 

BripGMAN, P. W. The logic of modern physics. 
New York: Macmillan, 1927. 

Burwen, L. S., & CAMPBELL, D. T. The gen- 
erality of attitudes toward authority and 
nonauthority figures. J. abnorm. soc. Psy- 
chol., 1957, 54, 24-31. 

CAMPBELL, D. T. A study of leadership among 
submarine officers. Colurabus: Ohio State 
Univer. Res. Found., 1953. 

CAMPBELL, D. T. Operational delineation of 
“what is learned” via the transposition ex- 
periment. Psychol. Rev., 1954, 61, 167-174. 








VALIDATION BY THE MULTITRAIT-MULTIMETHOD MATRIX 


CampBELL, D. T. Leadership and its effects 
upon the group. Monogr. No. 83. Colum- 
bus: Ohio State Univer. Bur. Business Res., 
1956. 

CARROLL, J. B. Ratings on traits measured by 
a factored personality inventory. J. ab- 
norm. soc. Psychol., 1952, 47, 626-632. 

Cu, P.-L. Statistical analysis of personality 
rating. J. exp. Educ., 1937, 5, 229-245. 

CRONBACH, L. J. Response sets and test valid- 
ity. Educ. psychol. Measmt, 1946, 6, 475- 
494, 

Cronpacu, L. J. Essentials of psychological 
testing. New York: Harper, 1949. 

CronpacuH, L. J. Further evidence on re- 
sponse sets and test design. Educ. psychol. 
Measmt, 1950, 10, 3-31. 

CRONBACH, L. J., & MEEHL, P. E. Construct 
validity in psychological tests. Psychol. 
Bull., 1955, 52, 281-302. 

Epwarps, A. L. The social desirability variable 
in personality assessment and research. New 
York: Dryden, 1957. 

FeicLt, H. The mental and the physical. In 
H. Feigl, M. Scriven, & G. Maxwell (Eds.), 
Minnesota studies in the philosophy of sci- 
ence. Vol. Il. Concepts, theories and the 
mind-body problem. Minneapolis: Univer. 
Minnesota Press, 1958. 

FiskE, D. W. Consistency of the factorial 


structures of personality ratings from differ- 
ent sources. J. abnorm. soc. Psychol., 1949, 
44, 329-344. 

GARNER, W. R. Context effects and the valid- 
ity of loudness scales. J. exp. Psychol., 1954, 
48, 218-224. 

GARNER, W. R., HAkKe, H. W., & ERIKSEN, 


C. W. Operationism and the concept of 
perception. Psychol. Rev., 1956, 63, 149-159. 
Jessor, R., & HamMonp, K. R. Construct 
validity and the Taylor Anxiety Scale. 
Psychol. Bull., 1957, 54, 161-170. 
KELLEY, T. L., & Krey, A. C. Tests and meas- 


105 

urements in the social sciences. New York: 
Scribner, 1934. 

Ke Lty, E. L., & Fiske, D. W. The prediction 
of performance in clinical psychology. Ann 
Arbor: Univer. Michigan Press, 1951. 

LoEVINGER, J., GLESER, G. C., & DuBots, 
P. H., Maximizing the discriminating power 
of a multiple-score test. Psychometrika, 
1953, 18, 309-317. 

Lorce, I. Gen-like: Halo or reality? Psychol. 
Bull., 1937, 34, 545-546. 

Mayo, G. D. Peer ratings and halo. 
psychol. Measmt, 1956, 16, 317-323. 

STRANG, R. Relation of social intelligence to 
certain other factors. Sch. & Soc., 1930, 32, 
268-272. 

Symonps, P. M. Diagnosing personality and 
conduct. New York: Appleton-Century, 
1931. 

THORNDIKE, E. L. A constant error in psy- 
chological ratings. J. appl. Psychol., 1920, 
4, 25-29. 

THORNDIKE, R. L. Factor analysis of social 
and abstract intelligence. J. educ. Psychol., 
1936, 27, 231-233. 

TuurstToneE, L. L. The reliability and validity 
of tests. Ann Arbor: Edwards, 1937. 

Tryon, R. C. Individual differences. In F. A. 
Moss (Ed.), Comparative Psychology. (2nd 
ed.) New York: Prentice-Hall, 1942. Pp. 
330-365. 

Unperwoop, B. J. Psychological research. 
New York: Appleton-Century-Crofts, 1957. 

VERNON, P. E. Educational ability and psy- 
chological factors. Address given to the 
Joint Education-Psychology Colloquium, 
Univer. of Illinois, March 29, 1957. 

Vernon, P. E. Educational testing and test- 
form factors. Princeton: Educational Test- 
ing Service, 1958. (Res. Bull. RB-58-3.) 


Received June 19, 1958. 


Educ. 





PSYCHOLOGICAL BULLETIN 
VoL. 56, No. 2, 1959 


THE IMPORTANCE OF TIME AS STIMULUS OF 
CONDITIONED REFLEX ACTIVITY! 


A. S. DMITRIEV anp A. M. KOCHIGINA 
Taroslavl’ 


Translated from Uspekhi Sovremennoi Biologii 


Investigations extending over 
many years on the conditioned reflex 
activity of the higher animals and 
man have shown that “innumerable 
variations in both the external and 
internal media of the organism, each 
of which is reflected in certain states 
of the cortical nerve cells, may them- 
selves become individual conditioned 
stimuli” (Pavlov, 1947, p. 51). 

It is quite natural that time, as one 
of the fundamental attributes of the 
existence of matter, and being closely 
linked with all variations in the ex- 
ternal and internal media of the or- 
ganism, should be very frequently 
found to be one of the components of 
the conditioned stimulus, emerging 
in the role of an exciter of conditioned 
reflex activity sui generis. The con- 
ditioned stimulus is not, of course, 
time merely as time, but a change 
predestined to occur at a definite 
time in the state of the cortical cells, 
a state induced by the action of the 
external or internal medium of the or- 
ganism. In consequence of the link- 
ing of these changes with time, the 
conditions are created for the reflec- 
tion of time as an objective reality in 
the cerebral cortex and for the forma- 
tion of conditioned reflexes to certain 
segments of time, as was demon- 
strated experimentally by the investi- 


1 This article was first published in Uspekhi 
sovremennot biologit, 1955, 40, 31-51. It was 
translated by the Pergamon Institute for the 
Russian Scientific Translation Program of the 
National Institutes of Health, U. S. Public 
Health Service. Reprints of this article may 
be obtained by writing directly to the Editor 
of the Psychological Bulletin. 
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gations of I. P. Pavlov and his co- 
workers. 

The formation of conditioned re- 
fléxes to time plays an important part 
in the systemic activity of the cere- 
bral cortex, in the development of 
definite periodicity in physiological 
functions, and in the establishment of 
rhythmical pattern reactions in the 
working activity of man. By virtue 
thereof the question of the cortical 
mechanisms involved in the process 
of formation of conditioned reflexes 
to time acquires considerable theoret- 
ical and practical importance. The 
present work is concerned with the 
analysis and generalization of find- 
ings given in the literature and the re- 
sults of our own investigations. 


| 


The fact that conditioned reflexes 
are formed to time was first estab- 
lished in Pavlov’s laboratory, in the 
experiments of G. P. Zelenyi (1907), 
and later, those of K. N. Krzhish- 
kovskii (1908), on the establishment 
of conditioned salivary reflexes in 
dogs, when the conditioned stimuli 
were administered at equal intervals 
of time. 

G. P. Zelenyi, working on the for- 
mation of conditioned salivary re- 
flexes to various sound stimuli, ad- 
ministered a combination of sound 
stimulus and feeding every 10 min- 
utes. After a number of repetitions, 
it was noted that conditioned reflex 
salivation occurred regularly at the 
end of the 10-minute interval, even 
in the absence of conditioned stimula- 
tion. 





TIME AS STIMULUS OF CONDITIONED REFLEX ACTIVITY 


In the work of K. N. Krzhishkov- 
skii on an investigation of condi- 
tioned inhibition it was established 
that the application of inhibition con- 
stantly at the nineteenth to twentieth 
minute led to the following events: 
the conditioned stimulus always 
proved inactive at the nineteenth to 
twentieth minute, but manifested its 
effect at the thirty-third to thirty- 
fourth minute. Krzhishkovskii ob- 
served this phenomenon over a period 
of six months. Wishing to determine 
whether time was the factor deter- 
mining inhibition under these circum- 
stances, he set up experiments for the 
establishment of a conditioned sali- 
vary reflex when a combination of the 
unconditioned stimulation (solution 
of hydrochloric acid) with the con- 
ditioned stimulus was administered 
constantly every 10-13 minutes. It 
was found that “if the animal's oral 
cavity was stimulated at equal inter- 
vals of time, then,’’"—in the author's 
words—‘‘with the passage of time 
such a state developed that at the ap- 
propriate moment, even in the ab- 
sence of any obvious stimulations, a 
flow of saliva began, and frequently 
also there was a characteristic motor 
reaction.” 

The importance of time as a factor 
in the establishment of conditioned 
reflexes has also been noted by other 
investigators in Pavlov’s laboratory 
in works referable to the same period. 
Thus, observations by I. P. Pimenov 
(1907), I. V. Zavadskii (1908), F. S. 
Grossman (1909), and V. M. Dobro- 
vol’skii (1911) showed that, in the 
case of trace and delayed reflexes, re- 
action to the conditioned stimulation 
supervened only after a certain time 
from the commencement of its action, 
that is, after the lapse of the segment 
of time which habitually elapsed be- 
fore reinforcement was administered. 
In other words, the conditioned reflex 
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was formed not simply to a particular 
stimulus, but to stimulus plus a cer- 
tain segment of time. 

These various investigations justi- 
fied consideration of the question of 
time as a stimulus of signal signifi- 
cance, and of the power of the cortical 
cells to reckon time. And even at 
that time Pavlov, on the basis of 
existing factual material, advanced a 
hypothesis on the mechanism of time 
reckoning in the central nervous 
system. 

“It may be thought,” he wrote, 
“that intensity analysis, in part at 
least, is a basic element in the meas- 
urement of time by animals. One can 
speculate whether any external agent 
of uniform, constant strength acts on 
the particular analyser of the animal, 
and whether there is gradual fading 
in the nerve cells of the residuum, 
the trace of actual stimulation which 
has ceased; each intensity of the stim- 
ulated state of the cell, at each sepa- 
rate moment of time, is an individual 
element, differing from both all pre- 
ceding and all following stages of in- 
tensity. Time is perhaps measured by 
these elements as units, and every 
moment of time recorded in the 
nervous system” (1947, pp. 115-116). 

Investigations dealing specially 
with the role of time in the formation 
of conditioned reflexes were first car- 
ried out by Iu. P. Feokritova (1912) 
at Pavlov’s suggestion. ‘‘We pro- 
posed to determine,’’ she wrote, 
‘“‘whether time... could be linked, 
like all phenomena in the external en- 
vironment, with the activity of the 
salivary gland, and whether it could 
serve as a specific salivary stimulus. 
If this proved to be the case, we 
wished to determine how rapidly the 
link between a definite time and the 
work of the salivary gland could be 
established, and thereafter to study 
to the fullest possible extent the 
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characteristic features of the new re- 
flex to time” (1912, p. 17). 

With these ends in view, lu. P. 
Feokritova established conditioned 
salivary reflexes in dogs, repeating 
the combination of conditioned stim- 
ulus (sound of the metronome) and 
unconditioned (feeding of meat- 
biscuit powder or the introduction of 
hydrochloric acid into the mouth) at 
predetermined intervals of time (in 
the case of one dog, every 30 minutes; 
for another every 15 minutes; and in 
the case of-a third, every 10 minutes). 
In a number of the experiments only 
one reinforcement was administered 
—at the same intervals of time. 

A stable reflex to time was formed 
after 200—230 repetitions (the number 
varied for the different animals), and 
was evidenced by the fact that after 
a certain interval of time, and just 
before the time for the next reinforce- 
ment, saliva secretion began. The 
author also noted that, before the 
formation of the stable conditioned 
reflex to time, there was irregular sal- 
ivation in the intervals between rein- 
forcements. Later, salivation began 
to be limited in time to the second 
half of the interval, and ultimately 
approximated more and more closely 
to the moment of reinforcement. 
Salivation in the interval between 
reinforcements was, however, not in- 
frequent (intersignal reaction) even 
after the formation of a stable condi- 
tioned reflex to time. 

According to Feokritova’s observa- 
tions, the conditioned reflex to “‘pure 
time” was formed more rapidly than 
when time was combined with an 
auditory stimulus. She explains this 
on the grounds that time, as an inde- 
pendent stimulus, is very weak, and 
that the auditory stimulus overshad- 
ows its action. 

Examining the effect of the metro- 
nome in combination with different 
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intervals of time (except that to 
which the conditioned reflex was 
formed), Feokritova found that the 
effect of the metronome, when not 
associated with a definite interval of 
time, was negligible. Thus, in the dog 
in which a conditioned reflex to a 30- 
minute interval was established the 
operation of the metronome at the 
fourteenth minute induced secretion 
of only one drop of saliva, whereas at 
the thirtieth minute the action of the 
same metronome was accompanied 
by the secretion of 6-11 drops. The 
author concludes from this that “‘in 
our summated reflex, time is the more 
active excitant and consequently also 
the main component, and not the 
metronome” (1912, p. 57). 

By her specially designed experi- 
ments Feokritova established that 
the differentiation of time can be 
brought to such a degree of accuracy 
that, for example, when‘a conditioned 
reflex has been established with a 30- 
minute interval, salivation is timed 
to occur exactly at the thirtieth min- 
ute, and is completely absent a min- 
ute earlier. 

In her next series of experiments 
Feokritova tested the action of vari- 
ous extraneous stimuli (whistle, ven- 
tilator, gramophone) on: the course of 
the conditioned reflex to time. Ap- 
plied in the middle of the interval be- 
tween reinforcements, ‘such stimuli 
did not, as a rule, occasion any 
changes whatever in the course of the 
reflex. When they were used simul- 
taneously with the metronome, then 
the conditioned reflex ‘to time was 
inhibited (although not in all dogs, 
and not in the same degree). Applied 
a minute before the metronome, and 
ending 20-30 seconds before it began 
to act, they sometimes inhibited and 
sometimes intensified the conditioned 
reflex to time. As the reflex became 
stable, the effect of extraneous stim- 
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uli became attenuated, or even disap- 
peared. When powerful extraneous 
stimuli were allowed to act and when 
any sudden change was introduced 
into the course or setting of the exper- 
iments, time differentiation was com- 
pletely destroyed. 

In this connection it is fitting to re- 
call the observations of the author 
on the effect of a somnolent state, 
which the animals frequently de- 
veloped in the course of the experi- 
ments, on the course of the condi- 
tioned reflex. Feokritova noted that 
a somnolent state not only failed to 
weaken the conditioned reflex to 
time, but even increased the differ- 
entiating power of the animal. 

Feokritova’s observations showed 
that extinction of the conditioned re- 
flex to time occurred suddenly. Not- 
ing the similarity in this respect to 
the extinction of trace reflexes, the 
author assigns the conditioned reflex 
to time to the group of trace reflexes. 

Feokritova concluded from her in- 
vestigations that time can be an ex- 
citant of conditioned reflex activity 
in the salivary gland, and that the 
formation of a conditioned reflex to 
time was subject to the same laws as 
the formation of conditioned reflexes 
to other stimuli. 

In offering an explanation of the 
mechanism for the reckoning of time 
by the nervous system, Feokritova 
wrote: ‘‘It is known that, after every 
stimulation, there remains in the 
cerebral cortex the trace of a whole 
series of states of stimulation of the 
nerve cell, gradually diminishing in 
intensity. At every individual mo- 
ment, in each short interval of time, 
the intensity of the nerve cell stimula- 
tion will be different from what it is 
at any other moment. If we take the 
intensity of nerve cell stimulation 
during each short interval of time as 
an independent unit of stimulation, 
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it can be stated that the exciter of the 
salivary centre in the case of a time 
reflex is that degree of intensity that 
always coincides in time with the act 
of eating’’ (1912, p. 162-163). 

The work of M. M. Stukova (1914) 
was a direct continuation of Feokri- 
tova’s investigations, and she con- 
firmed the fact that a conditioned re- 
flex could be established to time (a 
20-minute interval in combination 
with mechanical skin stimulation), 
the time reflex being formed very 
rapidly in her experiments (after 65 
repetitions). In dogs which had been 
transferred to her from Feokritova, 
Stukova noted the complete reten- 
tion of the conditioned reflexes and 
time differentiation established ear- 
lier. 

In her own experiments Stukova 
tested the effect of quite a range of 
different factors on the course of the 


conditioned reflex to time. Above all, 
she tested the effect of faradic cur- 
rents of different strengths. Her ob- 
servations demonstrated that the ef- 
fect depended on the current strength 
and the characteristics of the ani- 


mals’ nervous systems. In the dog 
with well-developed inhibition proc- 
esses currents of various strengths 
had no appreciable effect on the con- 
ditioned reflex, and only at the com- 
mencement of current action during 
the first application was some very 
transient disturbance of «ime differ- 
entiation observed. In excitable dogs 
inhibition of the conditioned reflex to 
time and disturbance of the correct 
reckoning of time were observed as a 
result of the action of faradic current, 
particularly if strong. A powerful 
current, acting momentarily, inhib- 
ited the reflex to time in all animals. 
Thus, like Feokritova, Stukova noted 
the inhibitory effect of change in the 
setting and course of the experiment 
on the conditioned reflex. 
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In her next series of experiments 
Stukova examined the effect of caf- 
feine (0.05 gm.) and of cocaine (0.02- 
0.03 gm.) on conditioned reflexes to 
time. After subcutaneous injection 
of caffeine, all the animals showed 
(after 8-11 minutes) increased ex- 
citability, disturbance in the reckon- 
ing of time, and disinhibition of time 
differentiation. Under the action of 
cocaine, in addition to increased ex- 
citability, the appearance of continu- 
ous salivation in the intervals be- 
tween reinforcements was observed. 
The reckoning of time by the nervous 
system was upset, and differentiation 
was disordered. 

Stukova established in special ex- 
periments that the conditioned reflex 
to time appears (although varying in 
the degree of its expression) in re- 
sponse to any of the components in 
the summated stimulus (e.g., to 
sight or smell of food only, without 
the reinforcement, and also to the 
action of the metronome alone or of 
the mechanical skin stimulation 
alone). 

“The reckoning of time by the ani- 
mals,’’ Stukova concluded, ‘‘is possi- 
ble, not only from a compound stimu- 
lation, that is, from the sum of many 
stimuli, as in our case (natural ali- 
mentary and artificial conditioned re- 
flexes), but also from any one of 
these, taken separately”’ (p. 136). 

The investigation of conditioned 
time reflexes was continued by V. S. 
Deriabin (1916) on the same animals. 
When he started work with these ani- 
mals, he first of all noted that in all 
the dogs the conditioned reflexes to 
time produced earlier were restored 
at the first attempt and immediately 
attained their former magnitude. 

Deriabin then tested the effect of 
a three-month interruption in the 
work. He demonstrated in this way 
that, with the lapse of three months, 
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all the dogs manifested disorder of 
time reckoning. 

Change in the experimental setting 
(particularly isolation cf the animal 
from the experimenter) had the same 
effect on the reflex. In this case the 
time reflex and time differentiations 
disappeared, to reappear only after 
prolonged repetition. 

Deriabin made a detailed investi- 
gation of the effects of sodium bro- 
mide and chloral hydrate on the re- 
flex to time; these substances were in- 
jected into the animal’s rectum, dis- 
solved in distilled water. Sodium 
bromide, 2 gm., had no effect on the 
magnitude of the conditioned reflex 
to time: ‘‘The intensification of the 
processes of internal inhibition,” 
wrote the author in this connection, 
“which is usually seen with sodium 
bromide in this quantity, did not lead 
to increase in the power of the dog’s 
central nervous system to discrim- 
inate time. Time differentiation re- 
mained unchanged” (p. 94). Chloral 
hydrate, 0.5 gm. to 2.0 gm., had no 
appreciable effect on the conditioned 
reflex to time, but in doses of 4-5 gm. 
it had a suppressing action on the re- 
flex, although the degree of suppres- 
sion differed in different animals. 

This author’s findings relative to 
changes in the conditioned reflex to 
time after destruction of individual 
parts of the cerebral cortex are of con- 
siderable interest. In one dog in 
which a conditioned reflex to the ac- 
tion of a ‘“‘kololka’’ (mechanical skin 
stimulator) in combination with a 20- 
minute interval of time had been 
established the cerebral region of the 
cutaneous analyser on one side was 
destroyed. In another dog in which a 
conditioned reflex to the sound of the 
metronome in combination with a 
time interval of 15 minutes had been 
produced, the cerebral region of the 
auditory analyser was removed on 
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both sides. In both cases the author 
noted temporary disappearance after 
the operation of the conditioned re- 
flex to the corresponding stimulus 
(‘‘kololka,’’ metronome) in combina- 
tion with time. With the lapse of a 
certain period, however, the condi- 
tioned reflex to time was restored 
completely: the conditioned reflex to 
‘pure time’’ was quite clearly ex- 
pressed in one dog seven days, and in 
the other two days after the opera- 
tion. 

In his own work Deriabin then ex- 
amined the question of whether the 
nervous system of the dog is capable 
of discriminating between the time 
properties of sound and its other 
properties. For this purpose he set 
up experiments in which conditioned 
reflexes were established to various 
sounds (tuning-fork, gurgling sound, 
whistle) with rhythmical intervals 
(two seconds’ sound—two seconds’ 
interval). Differentiation of the 


basic rhythm from rhythms of the 


same sound, but with longer or 
shorter sounds and intervals, was 
then achieved. These experiments 
showed that rhythm differentiation 
of this type, established to one sound 
stimulus (whistling), appeared im- 
mediately when other sound stimuli 
(buzzer, tuning-fork) were admin- 
istered. 

The author therefore concluded 
that ‘‘the dog is capable of distin- 
guishing the time properties of an 
auditory stimulus from other prop- 
erties of sound, and is guided by them 
in its physiological activity,’’ and 
that the nervous system of the dog 
“is capable of correlating similar 
time properties of several sound stim- 
ulations, as something common to 
the various types of sounds”’ (p. 141). 

We see, therefore, that the works of 
Iu. P. Feokritova, M. M. Stukova, 
and V. S. Deriabin have provided a 
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simple experimental basis for Pav- 
lov’s hypothesis on time as a true 
excitant of conditioned reflex activ- 
ity in animals. 

“Tt can be stated,’’ wrote Pavlov 
in connection with Feokritova’s ex- 
periments, ‘“‘that in this particular 
case time was the conditioned stim- 
ulus” (1947, p. 50). This stimulus 
“is in no way less real,’’ he pointed 
out, ‘‘than all the preceding stimuli” 
(1947, p. 49). 

‘Physiologically, how are we to 
understand time as a conditioned 
stimulus?’’ asked Pavlov. In answer- 
ing this question, he started from the 
fact that in ordinary life we note time 
with the aid of certain cyclical phe- 
nomena, the setting and rising of the 
sun, the movements of hands on the 
dials of clocks, etc. And in our own 
bodies there are not a few of these 
cyclical phenomena. In the course of 
a day the brain receives stimula- 
tions, becomes exhausted and then is 
restored again. The digestive canal 
is periodically filled with food, and 
periodically emptied. And in that 
every state of an organ can be re- 
flected in the cerebral hemispheres, 
we have there a basis for distinguish- 
ing one moment of time from another. 
Let us take short intervals of time. 
When stimulation has just been ad- 
ministered it is felt very acutely. 
When we enter a room in which there 
is some odor or other, we at first feel 
it very strongly, but afterwards less 
and less. Under the influence of 
stimulation the state of the nerve cell 
undergoes a series of changes. It is 
exactly the same in the converse 
case. When stimulation ceases, this 
is felt very acutely at first, and then 
less and less, until finally we are un- 
aware of it. This again means that 
there are a number of different 
states of the nerve cell. Cases of re- 
flexes to interruption of the stimulus 
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and trace reflexes, as well as cases of 
reflexes to time can be explained 
from the same point of view. In the 
experiment referred to (an experi- 
ment of Feokritova) the animal was 
fed periodically, and in association 
therewith a number of organs mani- 
fested a certain activity, that is, they 
underwent certain successive changes. 
All this was registered in the cerebral 
hemispheres, was received by them, 
and the definite moment of these 
changes became a conditioned stim- 
ulus” (1947, p. 50-51). 

After the work of Feokritova, 
Stukova, and Deriabin, salivary con- 
ditioned reflexes to time were ex- 
amined by a number of authors. 
These investigations, as well as giv- 
ing greater definition to the condi- 
tions attending the formation of re- 
flexes to time, demonstrated their 
importance in the systemic activity 
of the cerebral cortex. 

Thus, F. D. Vasilenko (1932), 
working with a pattern of condi- 
tioned food stimuli, repeated at iden- 
tical intervals of time, noted increase 
from experiment to experiment in the 
conditioned reflex reaction to one of 
the weak stimuli in the pattern 
(light). The author established that 
this increase in the secretory effect to 
light was the result of the summation 
of the conditioned reflex to light with 
the reflex to time which was being 
developed. 

F. P. Maiorov (1933) also de- 
scribes the formation of a conditioned 
reflex to time in relation to the action 
of a pattern. The author, having 
established a conditioned salivary re- 
flex to the sound of the metronome 
(60 beats per minute) and a motor 
conditioned reflex to a different 


metronome frequency (120 beats per 
minute), began to alternate the ac- 
tion of these stimuli at equal inter- 
vals of time (six minutes). Compara- 
tively soon (after nine repetitions) 
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the author was able to establish a 
conditioned reflex to time, in which 
at first, in complete independence of 
the nature of the conditioned stimu- 
lation, both salivation and motor re- 
action appeared towards the end of 
the six-minute interval. Later, how- 
ever, the conditioned reflex to time 
became differentiated; towards the 
end of the sixth minute, before the 
operation of the metronome (120 per 
minute), the motor reaction ap- 
peared, and towards the end of the 
sixth minute before the operation of 
the metronome (60 beats per min- 
ute), salivation began. 

E. G. Vatsuro (1948), employing a 
pattern of stimuli, established in 
dogs, over a period of several months 
without interruption, a conditioned 
reflex to time, which manifested it- 
self in the secretion of saliva 15-30 
seconds before the beginning of the 
action of the conditioned stimulus, 
but subsequently the moment of the 
commencement of salivation coin- 
cided more and more closely with the 
moment of action of the conditioned 
stimulus. On other days feeding of 
the animal was carried out at the 
same intervals of time as,in the pre- 
ceding experiments, but without ad- 
ministration of the conditioned stim- 
uli. It was found that in this case the 
reflex to time reproduced the picture 
of the conditioned pattern of reac- 
tion exactly. ‘‘The exactiness of the 
reproduction,” wrote the author, 
“‘reached the point of coniplete iden- 
tity of the ratio of the values of the in- 
dividual reflexes in the system with 
the corresponding reflexes to time”’ 
(1948, p. 13). 

The importance of the time factor 
in the systemic activity cf the cere- 
bral cortex has been noted in their 
respective works by P. &. Kupalov 
(1929, 1931, 1933), E. A. Asratian 
(1938) and others. ; 

The features in the formation of 
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conditioned reflexes to time, orig- 
inally discovered in experiments on 
dogs by. the salivary secretion 
method, have been given greater defi- 
nition by investigations on the motor 
conditioned reflexes to time in these 
animals. 

One of the first descriptions of the 
motor-defence conditioned reflex to 
time in dogs is found in the work of 
I. S. Beritov (1932). In experiments 
on dogs he repeated electrical stimu- 
lation, causing a defensive reflex 
flexion of a front limb, every five 
minutes and found that a conditioned 
reflex to time was formed after 40 
repetitions. This reflex manifested 
itself in the fact that a minute before 
the next stimulation was due, the 
animal (which up to this point had 
been in a somnolent state) woke up, 
shook its head, and raised the limb. 

In the literature we meet with very 
diverse findings on the question of 
the rate of, and conditions for, the 
formation of conditioned motor re- 
flexes to time in dogs. Thus, in the 
work of L. S. Gambarian (1952) there 
is evidence of the very rapid forma- 
tion of the motor-defence conditioned 
reflex to time in the case of a three- 
minute intersignal interval (unfor- 
tunately, the author, whose investi- 
gation had different aims, did not 
dwell in detail on his description of 
the process of formation of the condi- 
tioned reflex to time). On the other 
hand, O. P. Bolotina (1952a) states 
that with a three-minute interval the 
conditioned reflex to time is formed 
much more slowly, and sometimes 
fails completely to develop. Thus, 
having established a conditioned re- 
flex to pressure with a forepaw on the 
pedal of a special apparatus in dogs, 
and repeating it with food reinforce- 
ment at 10-minute intervals, Bolotina 
noted the formation of a conditioned 
reflex to time only after 18 experi- 
ments (180 combinations). The at- 
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tempt to transfer the conditioned re- 
flex for the 10-minute intervals to 
shorter intervals (1-3 minutes) was 
successful only in rare cases. The 
author noted the formation of an un- 
stable conditioned reflex to a two- 
minute interval after 52 experiments 
(520 combinations) in only one of 
three dogs. In the other dogs this 
was not achieved; a stable condi- 
tioned reflex could not be formed 
even to a three-minute interval, de- 
spite a large number of repetitions 
(344). In these experiments Bolotina 
noted only the appearance of a large 
number of intersignal reactions which, 
with repetition of the experiments, 
did not diminish in number, and did 
not become concentrated in the sec- 
ond half of the intersignal interval, as 
occurred in the course of the forma- 
tion of the conditioned reflex to the 
10-minute interval. The difficulty, 


and in a number of cases the impossi- 
bility, of forming conditioned reflexes 


to short intervals of time is due, in the 
opinion of the author, to the fact that 
in the case of a very short interval of 
time the traces of stimulation fail, 
against a background of alimentary 
excitation, to become concentrated, 
but, being summated, radiate through 
the cortex, and manifest themselves 
as a mass of intersignal pressures 
(1952a, p. 32). 

According to the observations of 
Bolotina (1952a), a motor condi- 
tioned reflex to time is formed more 
readily with a compound stimulus 
than to “pure’’ time. Extinction of 
the conditioned reflex to time was 
reached after three—five applications 
without reinforcement. Examining 
the effect of extraneous stimuli on 
the conditioned reflex to time, the 
author found that the operation of 
extraneous stimuli immediately be- 
fore the reinforcement or in the first 
seconds of its application caused in- 
hibition of the conditioned reflex to 
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time in a number of cases, but their 
operation anywhere in the middle 
part of the intersignal interval usu- 
ally failed to cause any appreciable 
changes in the conditioned reflex to 
time. Bolotina’s observations indi- 
cate that changes in the experimental! 
setting or interruptions in the work 
produced very marked disturbances 
in the conditioned reflex to time. Spe- 
cial experiments made by this author 
showed that hunger disturbed the 
animal’s conditioned reflex to time, 
the manifestations being the appear- 
ance of intersignal motor reactions. 
In another series of experiments 
Bolotina (1953) investigated the ef- 
fect of bromide and caffeine on motor 
conditioned reflexes to time in dogs. 
This work showed that bromide in 
doses of 0.5-1.0 gm. hastened the 
formation of the conditioned reflex 
to time, and at the same time created 
conditions for the formation of a con- 
ditioned reflex to shorter intervals of 
time (two minutes), to which previ- 
ously a conditioned reflex to time 
could not be established. When the 
conditioned time reflex was dis- 
turbed (as, for example, in attempts 
to transfer it to shorter intervals of 
time), the use of bromide in the same 
dose promoted its restoration and 
stabilization. In doses of 2-3 gm., 
bromide intensified the conditioned 
reflex to time. The use of bromide in 
combination with caffeine had a more 
pronounced positive effect on condi- 
tioned reflexes to time in dogs. 
Motor-defence conditioned reflexes 
to time have been studied in detail 
by A. M. Kochigina in our labora- 
tory. Her experiments showed that a 
motor-defence conditioned reflex to 
“‘pure’’ time (electrical stimulation 
repeated every five minutes) was 
formed very slowly, passing through 
certain stages in the course of its de- 
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velopment. The process of condi- 
tioned time reflex formation began 
with the appearance of .intersignal 
motor reactions, the number of which 
increased every time, reaching 50-60 
in each five-minute interval. Within 
the limits of the five-minute interval 
the intersignal reactions were irregu- 
larly distributed, as the dotted line 
curve in Fig. 1 shows. This curve has 
a markedly undulant character, which 
means that the groups of intersignal 
reactions alternated with pauses of 
longer or shorter duration, in which 
the intersignal reactions were absent 
or scanty. This initial stage in the 
formation of the conditioned time re- 
flex we have called the stage of gen- 
eralized conditioned reflex to time. 
In the next stage the number of 
intersignal reactions begins to dimin- 
ish, and they become less and less 
frequent in the first half of the five- 
minute interval until they are no 
longer observed in this period, being 
concentrated in the second half of the 
interval, and increasing in number as 
the moment for the next reinforce- 
ment approaches. The ciange in the 
number of intersignal reactions oc- 
curring in the five-minute interval 
during this stage is shown by the con- 
tinuous line curve in Fig. 1. We see 
that it is shorter by half than the 
dotted line, being displaced into the 
second half of the interval, and shows 
a notable rise towards the moment of 
the next reinforcement. We have 
termed this stage in the formation of 
the conditioned reflex to time the 
stage of formation of the differenti- 
ated conditioned reflex to time. 
Subsequently, the number of inter- 
signal reactions continues to dimin- 
ish until they disappear completely. 
Only in the few seconds preceding the 
moment of the next reinforcement is 
there the occasional appearance of a 
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Fic. 1. DETAILS OF THE APPEARANCE OF INTERSIGNAL REACTIONS AT VARIOUS STAGES IN 


THE FORMATION OF A Motor-DEFENSE CONDITIONED REFLEX TO TIME (DoG, RozKa). 

Time is plotted on the abscissa from the moment cf application of the conditioned stimulus, 
and on the ordinate, the number of intersignal reactions in every 30-second segment of the 
five-minute intersignal interval. The dotted line shows the distribution of intersignal reactions 
in one of the five-minute intervals in experiment No. 4 of 13 October 1953, and the continuous 
line, those in a five-minute interval from experiment No. 27 of 14 November 1953. 


motor reaction. Thus, a more or less 
differentiated conditioned reflex to 
time has been formed. It was formed 
after 90-135 repetitions. The ac- 
curacy of the time reckoning after 
the formation of the differentiated 
conditioned reflex to time (expressed 
as the ratio of the time to the appear- 
ance of motor reactions to the total 
duration of the intersignal interval) 
ranged from 80 to 93 per cent in our 
dogs. 

The duration of the individual 
stages and their relationship to one 
another in the process of the forma- 
tion of a conditioned reflex to time 
varied in the different animals, as 
will be readily seen from Fig. 2. The 
differences in the duration of the sec- 


ond stage were particularly notice- 
able, and a certain parallelism was 
observed between the duration of 
this stage and the total time required 
for the formation of the differentiated 
conditioned reflex to time. 

The motor-defence conditioned re- 
flex to time was formed in the dogs 
more rapidly to the compound stim- 
ulus (sound plus the five-minute in- 
terval) than to “‘pure’”’ time (after 64 
and 97 repetitions respectively). 
During the process of the formation 
of the conditioned time reflex to the 
compound stimulus exactly the same 
stages were observed, but the number 
of intersignal reactions under these 
experimental conditions was consid- 
erably !ess. The accuracy of time 
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DURATION OF THE STAGES IN THE PROCESS OF FORMATION OF A Motor-DEFENCE 


The columns with lines in one diagonal show the duration (in number of repetitions) of the 
initial generalized stage of the conditioned reflex to time, the hatched columns that of the stage 
of the differentiated time reflex, and the white columns the total time required ! or the formation 


of the differentiated reflex to time. 


reckoning after the formation of the 
differentiated conditioned reflex to 
the compound stimulus was the same 
as for ‘‘pure’’ time. 

In the dogs the conditioned reflex 
to time was extinguished very rapidly 
(after 4-5 unreinforced applications). 
After a 10-day interruption, the con- 
ditioned time reflex was restored 
after 10-15 repetitions, the picture of 
restoration of the conditioned reflex 
recalling that of its formation, but 
with a more rapid sequence of stages. 

We were able to confirm the find- 
ings of our predecessors relative to 
the effect of extraneous stimuli, 
bromide and caffeine, on the condi- 
tioned reflex to time. According to 


our observations, powerful extrane- 
ous stimuli caused temporary loss of 
time differentiation, and led to the 
appearance of a large number of in- 
tersignal reactions. Bromide, two 
grams, reduced the number of inter- 
signal reactions, and accelerated the 
process of the formation of a condi- 
tioned reflex to time. Caffeine in 
doses of 0.5-1.0 gm. led to the ap- 
pearance of a large number of inter- 
signal reactions, and disturbed the 
accuracy of time reckoning. 

We were unable to establish a 


motor-defence reflex to time to inter- 
vals of less than one minute, despite 
a large number of repetitions (300- 
500). 


With 30-second intervals be- 
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tween stimulations we observed a 
large number of intersignal reactions 
in all the animals, increasing with 
each successive experiment. Parallel 
with this, the animals became in- 
creasingly restless with each succeed- 
ing experiment, whimpering and 
barking and trying to bite their way 
out of their harness, until finally 
further attempts had to be aban- 
doned. 

In addition to salivary and motor 
conditioned reflexes to time, the con- 
ditioned reflex changes in metabo- 
lism, induced in dogs through definite 
time intervals, were also investigated. 
The work of K. M. Bykov and his 
colleagues on this subject demon- 
strated that a conditioned reflex to 
time could also be formed on the 
basis of metabolic changes. Thus, 
G. V. Nesterovskii and A. D. Slonim 
(1936) investigated thermal polyp- 
noea in dogs (that is, accelerated 
respiration developed in response toa 
rise in the environmental tempera- 
ture), having established a condi- 
tioned reflex (to the sound of the 
metronome) on the basis of this un- 
conditioned reflex reaction. The 
authors then used the metronome 
every five minutes, and observed reg- 
ularly recurring accesses of polypnoea 
at five-minute intervals, even in the 
absence of metronome action. R. P. 
Ol’nianskaia and A. D. Slonim (1938) 
observed the formation of condi- 
tioned reflexes to time in metabolic 
investigations on dogs. The animals 
were kept for five hours in a cold 
room (temperature: 10° C.) or in a 
warm room (temperature: 22° C.) 
corresponding changes in metabolism 
naturally resulted—and were then re- 
turned to the kennels in which the 
temperature was below that of the 
warm, but higher than that of the 
cold room. Repeating these experi- 
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ments systematically, the authors 
noted that toward the end of the five- 
hour stay the metabolism of dogs in 
the warm room began to increase, 
and that of dogs in the cold room, to 
fall. 

Considering this to be the result of 
the formation of time associations, 
K. M. Bykov remarked that ‘‘these 
associations were established to a 
definite interval of time, that preced- 
ing the transfer of the dog to the ken- 
nels in which the animals’ metab- 
olism adjusted itself to the tempera- 
ture of the surroundings” (1947, p. 
137). This level of metabolism in the 
kennels was higher than in the room 
when heating was used in the experi- 
ments, but was lower than in the 
room in the cold experiments. Sim- 
ilar results were obtained in these ex- 
periments when the animals were 
kept in a room with a neutral tem- 
perature (15-16° C.) and were then 
transferred to a warm (22° C.) ora 
cold (10° C.) room. Here, the metab- 
olism of the dog about to be trans- 
ferred to a warm room fell, and that 
of the dog transferred to the cold 
room rose. 

We have seen, then, that condi- 
tioned reflexes to time can be formed 
in dogs on the basis of quite a variety 
of unconditioned reactions. The in- 
vestigations of conditioned time reac- 
tions in dogs have shown that these 
conditioned reflexes are formed with 
much greater difficulty than condi- 
tioned reflexes to other stimuli. They 
are less stable, are easily inhibited by 
the action of extraneous stimuli or by 
change in the experimental setting, 
and are rapidly extinguished in the 
absence of reinforcement. In the 
mechanism of their formation, condi- 
tioned time reflexes resemble trace 
reflexes. They are formed to trace 


stimulations in a variety of analysers. 
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These traces, recurring at equal in- 
tervals of time, form a single complex 
stimulus which, in combination with 
the basic reaction, becomes the ex- 
citer of this reaction. Like all condi- 
tioned reflexes, the conditioned reflex 
to time in dogs has a generalized 
character in the initial stage of its 
formation, and only later does it be- 
come more or less differentiated. 


II 


Conditioned reflexes to time, so 
thoroughly studied in dogs, have also 
been examined in other animals, and 
also in man. 

P. M. Nikiforovskii (1929) investi- 
gated conditioned reflexes to time in 
tortoises. In one tortoise he estab- 
lished a conditioned motor reflex to 
the smell of carnation oil, with the 
conditioned stimulus acting every 
three minutes. A conditioned reflex 
to “pure’’ time was established in 
another tortoise, the animal being 
compelled to retract its head every 
four minutes. The author noted the 
development of a conditioned reflex 
to time in both tortoises. The first, 
after several experiments, began to 
draw in its head exactly half a minute 
before the exhibition of the carnation 
oil smell, and the second, on each oc- 
casion a short time before the expira- 
tion of the four-minute period, re- 
tracted its head, irrespective of what 
it was doing at the moment (in move- 
ment or at rest). The author re- 
marked that conditioned reflexes to 
time in tortoises are distinguished by 
their extreme instability; they are 
extinguished by the omission of only 
one reinforcement, but their re-estab- 
lishment requires repeated reinforce- 
ment. He also drew attention to the 
ease with which conditioned time re- 
flexes in tortoises are inhibited by ex- 
traneous stimuli (e.g., by the sound of 
a passer’s-by footsteps). 
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B. I. Baiandurov (1937) made a de- 
tailed study of conditioned reflexes to 
time in birds (pigeons). His experi- 
ments were carried out ‘in the follow- 
ing manner. The electrodes of an in- 
duction apparatus, in: the primary 
circuit of which there was a timing 
device closing the circuit after a defi- 
nite interval, were brought into 
contact with the foot of a pigeon, 
placed in a special chamber. Move- 
ment of the foot in response to stimu- 
lation by the current was recorded by 
means of a lever on the smoked drum 
of a kymograph. Describing the re- 
sults of his investigatiors, the author 
noted particularly that it was impos- 
sible to produce a conditioned reflex 
to short intervals of time (5-15 min- 
utes) in birds, despite a large num- 
ber of repetitions. Even the forma- 
tion of a conditioned reflex to an in- 
terval of 30 minutes took place quite 
slowly (after 300 repetitions). The 
conditioned time reflex in birds was 
unstable and was rapidly extin- 
guished (after two-three omissions of 
reinforcement). The conditioned re- 
flex to time, when formed, was not al- 
ways retained until the following day. 
Alcohol abolished the conditioned 
reflex to time. In the course of the 
formation of a conditioned reflex to 
time in birds, Baiandurov noted the 
appearance of involuntary move- 
ments in the intervals between the 
electrical stimulations (on the 25th 
day from the commencement of the 
experiment, after 150 repetitions), 
and these movements increased in 
number with each successive day, be- 
coming more and more concentrated 
every time towards the time of circuit 
closure. 

The formation of conditioned re- 
flexes to time in various animals has 
been noted in a number of works by 
K. M. Bykov and his colleagues in 
connection with the analysis of dif- 
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ferent types of periodic changes in 
physiological functions. Thus, L. G. 
Filatova (1949), examining the daily 
periodicity in the long-eared hedge- 
hog, found that in these animals 
the greatest muscular activity oc- 
curred during the night-time (1600— 
0200 hours), and activity fell to 
a minimum in the daylight hours. 
Coincident with this, there was an 
increase in the secretion of gastric 
juice in the evening and night hours. 
“This whole picture,’’ wrote the 
author, ‘‘can be regarded as the for- 
mation of a conditioned association 
to time, elaborated in the process of 
phylogenesis in connection with the 
constant hunting for prey in these 
hours” (1949, p. 129). 

By giving food to the hedgehogs in 
several experiments during daytime, 
Filatov formed new conditioned re- 
flexes to time, as a result of which the 
animals manifested two periods of 
motor and secretory activity—one by 
day and one by night. From his ob- 
servations the author concluded that 
conditioned linkage to time is a 
fundamental element in the daily pe- 
riodicity of rest and activity in this 
species of animal. The formation of 
conditioned reactions to time in long- 
eared hedgehogs was confirmed by 
the experiments of L. A. Isaakian 
(1953a), who investigated the condi- 
tioned reflex thermoregulation mech- 
anism in these animals. When the 
action of a certain temperature had 
been combined a number of times 
with the particular setting and time 
of the experiment, the author found 
that for several days after change of 
temperature the former level of me- 
tabolism was maintained. For ex- 
ample, when the animals were kept 
for 20 days in a chamber with the 
temperature at 22—25° C., the body 
temperature and oxygen consump- 
tion became established at a definite 
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level. When the animals were trans- 
ferred to a room with a temperature 
of 10-12° C., no noticeable change in 
their metabolic rate was observed 
during the first two-three days, an 
event which the author correctly re- 
garded as the result of the formation 
of a conditioned reaction to the ex- 
perimental setting and time of the 
previous experiments, and, in the ab- 
sence of reinforcement, the extinc- 
tion of the time association so formed 
took place, according to his observa- 
tions, on the third to fifth day. 

Natural conditioned reflexes to 
time, associated in their origin with 
the times of taking food, have been 
described by L. A. Isaakian (1953b) 
from his investigations on rabbits. 
His observations showed that in rab- 
bits the temperature of the ears rises 
regularly during the hours of feeding. 
For example, the ears of rabbits 
which were fed at 1500 hours gen- 
erally showed a raised temperature 
from 1200 to 1800 hours, after which 
the temperature fell abruptly. The 
same daily variations in ear tempera- 
ture were seen in hungry animals, 
that is, in rabbits which had not been 
fed at the established hour. When 
the daily regime was changed (trans- 
fer of the feeding time from 1500 to 
2000 hours), a new rhythm of ear 
temperature variation was formed, 
the maximum rise of temperature 
falling within the period 2000 to 0200 
hours. 

The importance of the develop- 
ment of conditioned reflexes to time 
in the creation of a daily periodicity 
in basal metabolism was noted also 
by A. D. Slonim (1945) and by A. G. 
Ponugaeva (1949) in experiments on 
bats. 

K. B. 


Svechin (1952), studying 
changes in the daily rhythm of physi- 
ological functions in cattle, found 
that, in young bullocks transferred 
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from the conditions of maintenance 
in stalls to field life, a certain type of 
daily fluctuation in pulse rate and 
temperature developed, and that this 
was maintained on days when the ex- 
ternal conditions (variations in the 
outside temperature, in relative at- 
mospheric humidity, etc. were con- 
siderably changed from those prevail- 
ing on the preceding days. In ex- 
actly the same way also, animals 
transferred from field conditions to 
stall conditions retained for several 
days the periodicity in physiological 
functions established while they were 
kept in the open. The author ex- 
plains the results of his investigations 
as indicating that a definite rhythm 
of changes in the environmental con- 
ditions creates a conditioned reflex 
to time in the animals. 

The formation of conditioned re- 
flexes to time under various condi- 
tions in monkeys is described in a 
number of works by K. M. Bykov 
and his colleagues. 

The formation of the conditioned 
reflex to time in monkeys (Abyssin- 
ian baboon, M. rhesus, M. lapunder) 
was observed in an unusual form by 
O. P. Shcherbakova (1937, 1938, 
1949). By artificially reversing the 
alternation of light and darkness, the 
author was able to reverse the ani- 
mal’s daily periodicity. By creating 
two changes of light and darkness in 
the 24 hours, Shcherbakova estab- 
lished a regime of two ‘‘nights’’ and 
two ‘‘days”’ in the course of 24 hours. 
It is a curious fact that, on transition 
to the previous normal light regime, 
the periodic changes in functions cor- 
responding to the prolonged, artifi- 
cially created periods of light and 
darkness, were maintained for some 
time. ‘‘It seemed to me completely 
logical to assume,’ wrote K. M. 
Bykov, in connection with these ob- 
servations, ‘‘that the regular occur- 
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rence of alternation of day and night 
(irrespective of whether they are 
natural, astronomical, or created 
artificially for the monkeys by alter- 
nation of the periods of lighting) 
likewise creates a conditioned reflex 
to time in the cerebral! cortex’’ (1947, 
p. 153). 

Later K. P. Ivanov, A. R. Maka- 
rova, and A. A. Fufacheva (1953) con- 
tinued these investigations of Shcher- 
bakova on the effect of periodic al- 
ternation of light and darkness on 
the periodicity of physiological func- 
tions in monkeys (macaques, ba- 
boons, long-tailed monkeys). Not- 
ing that the level of gaseous exchange 
was lowest at the time twilight ap- 
proached, these authors regarded 
twilight illumination as the signal 
stimulus, being a component of the 
daytime lighting and occurring also 
at the boundary between the two 
astronomical periods, day and night, 
which determine different conditions 
in the organism—the transition of 
the animal from daytime activity to 
nighttime rest. This signal stimulus 
enters into the natural conditioned 
reflex pattern. The authors note that 
when artificial twilight was created 
30 minutes before the advent of nat- 
ural twilight, the fall in metabolism 
at the usual time of twilight was 
maintained unchanged for the first 
two days, and only on the third day 
did a complete change in metabolism, 
corresponding to the new light condi- 
tions develop. ‘‘Thus,’’ concluded 
the authors, ‘“‘twilight not only causes 
a fall in metabolism and is of active 
inhibitory character, but is also 


closely associated with a conditioned 
time reflex, involving the entire daily 
periodicity in the physiological func- 
tions of the organism”’ (p. 101). 
Analysing the daily periodic fluctu- 
ations in the blood sugar content of 
monkeys (M. rhesus, Abyssinian ba- 
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boon), E.S. Kanfor (1949) found that 
an increase in blood sugar occurred 
in relation to the feeding times, and 
regarded this as the result of the de- 
velopment of a conditioned reflex to 
time. 

The formation of a conditioned re- 
flex to the feeding time in monkeys 
was also noted by L. G. Voronin 
(1948). He observed the formation 
of a conditioned reflex to the time of 
feeding, manifested by the appear- 
ance of sucking movements and gen- 
eral motor unrest towards the times 
of feeding in young rhesus monkeys 
at about the third week. 

L. G. Voronin (1951) described the 
formation of conditioned linkages to 
time in monkeys (baboons, ma- 
caques, long-tailed monkeys) in ex- 
periments in which a system of stim- 
uli repeated at regular intervals was 
used. Thus, when the conditioned 
foodseeking reflex was repeated every 
two minutes, it was very soon noted 
that toward the end of the second 
minute after the administration of 
the stimulus the monkey ran to the 
feeding box and tried to reach the 
food. After establishment of the 
conditioned reflex to the stereotype 
of stimuli, operating at minute inter- 
vals, it could often be seen that, even 
in the experiment without use of the 
conditioned stimuli (‘‘sham”’ experi- 
ment), the animal ran to the food box 
every minute and pressed the lever. 
At first the conditioned reaction to 
time was not absolutely accurate 
(the animal applied pressure 10-15 
seconds before the end of the minute 
interval), but with continued feeding 
of the animal every minute, the reflex 
to time became progressively more 
and more exact. 

According to the findings of Voro- 
nin and his colleagues, the condi- 
tioned reflex to ‘‘pure’’ time (in which 
only feeding of the animal was car- 
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ried out at regular intervals) was 
more difficult to establish in mon- 
For example, in rhesus mon- 
keys establishment of a conditioned 
reflex to ‘‘pure’’ time (1 minute) re- 
quired about 400 feeding reinforce- 
ments. In Voronin’s opinion, the con- 
ditioned reflex to time in these cases 
is formed much more quickly, but is 
masked by the conditioned reflex to 
the setting. 

When the intersignal intervals are 
changed from one to two or three 
minutes, the conditioned reflex to the 
former time is preserved in the first 
two or three experiments, but fairly 
soon thereafter (after 6-10 experi- 
ments) a conditioned reflex to the 
new time is established. In monkeys 
it is much more difficult to establish 
conditioned reflexes when the inter- 
signal intervals are prolonged to 
three-five minutes. When the inter- 
val between stimuli was considerably 
lengthened, a large number of inter- 
signal reactions became evident, and 
these Voronin regarded as the mani- 
festation of the earlier established 
conditioned reflexes to time and as 
a result of disinhibition of the condi- 
tioned reflex to the setting. The op- 
timum interval between stimuli for 
the formation of a conditioned reflex 
to time in monkeys is thought by the 
author to be one-two minutes. 

Voronin regarded the suppression 
of the conditioned reflex to the stimu- 
lus preceding the differential signal in 
the stereotype of stimuli, which was 
frequently observed in monkeys, as a 
special case of a conditioned reflex to 
time (‘‘retrograde”’ inhibition of E. A. 
Asratian, 1934). In the experiments 
of Voronin and his co-workers, the 
system of stimuli was so arranged 
that every third or fourth (and some- 
times sixth) stimulus was a differenti- 
ation stimulus. Consequently the 
second, third, or fifth stimulus was, 


keys. 
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in Voronin’s opinion, not only the 
signal of a positive conditioned reflex, 
but was also a signal for the following 
differentiation, so that sometimes 
these signals acquired also a negative 
signal significance. ‘‘We suggest,” 
wrote the author, ‘‘that we are deal- 
ing here with an instance of that 
power of the nervous system to 
‘reckon time,’’ which shows itself in 
the form of a conditioned reflex 
(1951, p. 221). 

O. P. Bolotina (one of Voronin’s 
co-workers) gives a detailed descrip- 
tion in her work (1952b) of the for- 
mation of conditioned reflexes to 
time in monkeys. According to her 
observations, such a conditioned re- 
flex is readily formed, and begins to 
appear even on the first experimental 
day, but the time required for its 
stabilization increases as the inter- 
signal interval is lengthened. The 
conditioned reflex forms more slowly 
to ‘‘pure”’ time than to a compound 
stimulus (time plus auditory or 
photic stimulation). In the latter 
case Voronin employed a pattern of 
sound and light stimuli, following 
each other at regular intervals, while 
reinforced and unreinforced stimuli 
recurred regularly in a fixed order. 
When the stimuli were suppressed 
the conditioned reflex reaction ap- 
peared at the same intervals, but in 
the places of the unreinforced stimu- 
lations the reflex was unstable. When 
the unreinforced stimulations were 
excluded from the pattern, the condi- 
tioned reflex to time was more stable. 
The conditioned reflex to “pure” 
time (the interval between stimuli 
being one minute) appeared toward 
the end of the third experiment, but 
was stabilized only after 450 repeti- 
tions. 

In a special series of experiments 
Bolotina made observations on 
change in the time interval. These 
experiments showed that the longer 
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the time interval was, the greater 
was the number of repetitions re- 
quired for stabilization of the condi- 
tioned reflex to time. On transition 
from one interval to another, the first 
two days showed a conditioned reflex 
to the former time interval, although 
even from the very beginning a con- 
ditioned reflex to the new interval 
was also in evidence. On the follow- 
ing days inhibition of both the old 
and the new conditioned reflexes was 
observed, and this the author ex- 
plains as the manifestation of an ori- 
entation reaction to the new arrange- 
ment of the experiment. From the 
third day intersignal reactions ap- 
peared in large number, but these 
were gradually controlled and be- 
came more and more concentrated 
toward the end of the new time inter- 
val. The rapid formation of condi- 
tioned reflexes to time in monkeys 
and their facile transition from one 
time interval to another are ex- 
plained, according to the author, by 
the great mobility of the nervous 
processes in these animals. 

That the conditioned reflex to time 
in monkeys is more stable than in 
other animals was indicated by the 
absence of any appreciable inhibitory 
effect from the operation of an ex- 
traneous stimulus (whistle) towards 
the end of the intersignal interval. 
During extinction of the conditioned 
reflex to time a large number of inter- 
signal reactions first appeared, and 
only later (after from 1 hr. 33 min. to 
2 hr. 6 min.) did complete extinction 
develop. Extinction took place in a 
wave-like manner. The extinguished 
reflex was easily re-established. In 
monkeys the conditioned reflex to 
time was disturbed by any pro- 
nounced change in the experimental 
setting, such as, for example, delay of 
the start of the experiment by 3-10 
minutes. 

In her own subsequent investiga- 
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tions O. P. Bolotina (1953) found 
that small doses of bromide (0.2- 
0.35 gm.) had a certain controlling 
effect on conditioned reflex activity, 
evidenced by. a reduction in the num- 
ber of intersignal reactions. Larger 
doses (2—3 gm.) exercised a sustained 
positive action on the reflex to time. 
A mixture of bromide and caffeine 
had a more definite, positive effect. 
Caffeine, in doses of 0.02—0.07 gm., 
produced an increase in the number 
of intersignal reactions in all the 
monkeys. 

According to the findings of O. P. 
Bolotina and A. A. Popova (1953), 
phenamine, in a dose of 0.3 mg./kg., 
led to slight disturbance of time reck- 
oning in the monkeys in the daytime, 
with increase in the number of inter- 
signal reactions, whereas during 
nighttime it rendered the conditioned 
reflex to time more exact. In a dose 


of 1.5 mg./kg., it caused considerable 
inhibition of reflexes to time during 
the day, but in the hours of night it 


intensified the reflexes and made 
them more exact. 

In the view of Voronin and his co- 
workers, the conditioned reflex to 
time in monkeys “‘. . . is formed on 
trace stimulations coming from the 
motor analyser (pressure on the 
lever) and on trace stimulations from 
the digestive tract, these being con- 
stantly combined in relation to iden- 
tical segments of time. These traces 
in the cerebral cortex are built up 
into a definite physiological state of 
the cortical cells. This state becomes 
a conditioned stimulus for the nerv- 
ous system which adjusts the entire 
organism to meet the material condi- 
tions of the surroundings” (Bolotina, 
1952b, p. 203). 

Thus, the formation of conditioned 
reflexes to time has been established 
for animals of quite a variety of 
classes (reptiles, birds, mammals), 
and among mammals, for representa- 
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tives of various divisions (insecti- 
vores, cheiropterids, rodents, carni- 
vores, artiodactyls, monkeys). If we 
may judge from the similarity of the 
process of formation of conditioned 
reflexes to time in the different ani- 
mals (the relative difficulty of their 
production in comparison with con- 
ditioned reflexes to other stimuli, 
their facile inhibition, their rapid ex- 
tinction in the absence of reinforce- 
ment, etc.), the nervous mechanism 
involved in the process is basically 
the same in all vertebrates. In the 
natural living conditions of many 
animals the formation of conditioned 
reflexes to time is of considerable im- 
portance in the development of a pe- 
riodicity of physiological functions, 
typical for the particular animals. 
Investigations on the conditioned re- 
flexes to time in various animals con- 
firm Pavlov’s view on the mechanism 
for time reckoning by the nervous 
system, and confirm that the power 
to reckon time is inherent in the corti- 
cal cells of all analysers, and that it is 
unnecessary to postulate the ex- 
istence of a special cortical! ‘time an- 
alyser,”” as has been suggested by 
several physiologists (P. M. Niki- 
forovskii and others). 

The results of investigations on 
conditioned reflexes to time in ani- 
mals may find wide application in 
practical animal husbandry, in rela- 
tion to the development and stabili- 
zation of rational regimes for the 
maintenance and economic exploita- 
tion of farm animals. In relation to 
periodically recurring processes (feed- 
ing, watering, moving cattle, driving 
out to pasture, milking, rest) the es- 
tablishment of conditioned reflexes to 
the times of these processes, that is, 
the establishment of a complex time 
pattern, is important, among other 
measures, in the management of ani- 
mals. The importance attaching to 
the development of such a pattern is 
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determined, not only by the fact that 
it instils order into and facilitates the 
work of man in caring for animals, 
but also because the positive physi- 
ological effect of the measures being 
pursued is considerably enhanced 
thereby. Time, as a unique exciter of 
reflex activity, is capable (in conjunc- 
tion with other stimuli) of creating 
and strengthening a definite course in 
physiological processes, of giving 
them a definite rhythm, a definite in- 
terrelationship in time. This un- 
doubtedly creates more favorable 
conditions for the course of each of 
the processes. It is for this reason 
that the observation of strict regu- 
larity in the feeding times of ‘animals 
increases the productivity of the 
food, that the strict maintenance of a 
regular milking routine increases the 
yield, and so on. 

In State farm and collective farm 
practice measures are taken to create 
a constant regime of animal main- 
tenance: times and routines for feed- 
ing are laid down (this is often done 
separately for different groups of ani- 
mals), and definite times are fixed 
for exercising the animals, for milk- 
ing cows, etc. Numerous examples of 
these m* .sures are given in the work 
of A. V. Kvasnitskii and V. A. 
Koniukhova (1954). In agreement 
with the results of laboratory in- 
vestigations on conditioned reflexes 
to time, their formation and con- 
solidation in practical animal farming 
occurs more rapidly when the condi- 
tioned reflex is established not to 
‘“‘pure’”’ time, but to a complex stim- 
ulus. In practical work also an at- 
tempt is made to combine the time 
of a particular productive process 
with some kinds of concomitant sig- 
nals. In some cases, for example, 
feeding time was linked with the 
rapid, striking of a rail, exercise time 
with Slow striking, and so on. Very 
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often such concomitant signals, 
strengthening the conditioned reflex 
to time that had been formed, were 
verbal stimuli (being, of course, only 
sound stimuli to the animals). Thus, 
the arrival of exercise time was linked 
with the word ‘‘walk,” the arrival of 
the milking time for each cow with 
the pronouncing of its name, etc. 

Change in the established regime, 
that is, a change from some earlier 
established conditioned time reflexes 
to others, presents a certain amount 
of difficulty for the nervous system, 
as can be judged from the results of 
laboratory investigations. The same 
has been observed in animal hus- 
bandry under natural conditions. 
A. V. Kvasnitskii and V. A. Koniu- 
khova (1954) illustrate this by the fol- 
lowing example. Special investiga- 
tions showed that in pigs under the 
usual feeding conditions (morning, 
noon and evening) the secretion of 
gastric juice is maxiinum towards the 
feeding times, and falls during the 
rest of the time, particularly during 
the night. When the feeding times 
were changed (evening, midnight and 
early morning) the previous condi- 
tioned reflex to time, that is, increase 
in the secretion of gastric juice at the 
times of the previotus day’s feeding, 
was preserved for several days, and 
only gradually was a reflex to the new 
feeding times established. Such a 
reconstruction of the time pattern 
entailed a period of disturbance in 
the functioning of the coordinated 
organs, and indeed :any disturbance 
of the established regime produced 
the same result. Kvasnitskii and 
Koniukhova cite as an example the 
reduction in the milk-yield by five 
per cent and in the fat content of the 
milk to 0.2—0.4 per cent that resulted 
from nulking 30-40 minutes later 
than the usual time. 

While we stress the importance of 
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the constant maintenance of a set 
daily routine, it should at the same 
time be noted that the most rational 
regime varies for different conditions. 
Consequently, in the course of ani- 
mal-rearing practice the question of a 
change of regime arises very fre- 
quently (in connection with the 
transfer of the animal from one pro- 
duction group to another, with 
change in season etc.). It is, of 
course, obvious that in these circum- 
stances the disruption of the old time 
pattern will be accompanied by a 
drop in the animals’ productivity on 
the first few occasions. In order, 
therefore, to facilitate this readjust- 
ment, this adaptation of the animals 
to the new conditions, it is essential 
that one should understand the laws 
governing the formation and recon- 
struction of conditioned reflexes to 
time in animals. 

In man the physiological mecha- 
nism involved in the reckoning of 
time by the nervous system has not 
been adequately studied. The fact 
itself of the perception of time by 
man has, of course, been known for a 
long time. The perception of time by 
man and the various changes that 
occur in this perception under both 
normal and pathological conditions 
have long been the object of investi- 
gation by many psychologists, psy- 
chiatrists and neurologists. It was, 
however, only the doctrine developed 
by Pavlov on higher nervous activity 
that made it possible to link the in- 
vestigation of such matters with 
physiological investigations on the 
functions of the cerebral cortex, with 
investigations on conditioned reflex 
activity, and particularly conditioned 
reflexes to time in man. 

A number of works by lu. P. 
Frolov (1924, 1928, 1951) have been 
devoted to an analysis of conditioned 
time reflexes in man. Believing, in 
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agreement with the views of Pavlov, 
that ‘. . . dynamic changes in nerve 
traces, as the ‘differentials’ in the 
reckoning, constitute the funda- 
mental element in the reckoning of 
time’ (1951, p. 837), Frolov pointed 
out that the speed with which traces 
from stimuli are extinguished may 
vary with the state of the cortical 
cells, and that various errors in the 
assessment of time are, in his opinion, 
connected therewith. 

According to his findings, a tend- 
ency to over-assess time (within lim- 
its of five-eight per cent) was con- 
stantly observed in experiments on 
healthy individuals of various ages 
and occupations. The author ex- 
plained this by the development of 
internal inhibition in the subjects 
under the experimental conditions 
(darkness, silence), in consequence 
of which the trace from the signal- 
stimulus, from which the reckoning 
of time began, was damped down 
more rapidly than usual. In other 
words, trace processes reached a cer- 
tain degree of intensity earlier than 
they should, and therefore the true 
interval of time seemed longer to the 
subject (‘‘time was extended’”’). The 
same was observed, according to the 
author, in the case of anticipation. 
When the excitability of the cortical 
cells was heightened, on the other 
hand, time was under-assessed. 

Frolov described pronounced dis- 
turbances of time reckoning in vari- 
ous psychical illnesses (over-assess- 
ment of time in depression, Kor- 
sakov’s psychosis, etc.), and associ- 
ated these disturbances with the de- 
velopment of an inhibitory process in 
the cerebral cortex, and the more 
rapid suppression of traces from the 
stimuli. 

“From a practical point of view it 
is important to note,” the author con- 
cluded, ‘‘that all changes in the state 
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of the subjects’ health, the degree of 
their adaptation to work, and also the 
presence of emotional elation or de- 
pression were immediately reflected 
in the accuracy of their time reckon- 
ing, inducing acceleration or slowing 
in the cxlculation, over-assessment or 
under-assessment of the passage of 
time’ (1951, p. 838). 

During the establishment of a 
rhythmical pattern, the formation of 
conditioned reflexes to time in man 
was noted and examined in connec- 
tion particularly with the analysis of 
one of the phenomena observed dur- 
ing the process, and first described by 
V. M. Bekhterev (1908). 

Examining the motor reactions of 
man to rhythmically recurring 
sounds, Bekhterev observed that, 
after sudden cessation of the sound, 
the subject continued to execute sev- 
eral movements at the former rate. 
Also, as the period over which the 
sounds were in operation increased, 
and as the rate at which they fol- 
lowed one another was raised, so the 
number of motor reactions executed 
after the cessation of the sound in- 
creased. Subsequently, G. P. Zelenyi 
(1923) made a special study of this 
phenomenon by setting the subject to 
beat in time with a metronome (120 
beats per minute). When the metro- 
nome was stopped (after 120 beats) 
the subject made several additional 
movements at the same rate, and this 
occurred even when the subject was 
asked not to make any excess move- 
ments. The author also observed 
that when, after an interruption in 
the rhythmical work, the metronome 
gave one beat, the subject made two 
or several movements in succession 
with the same intervals between them 
as between the beats of the metro- 
nome. In a joint work with B. N. 
Kadykov, G. P. Zelenyi (1937) found 
that the motor reactions very often 
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outstripped the conditioned signals, 
and developed before the next suc- 
ceeding beat of the metronome. Ac- 
cording to the observations of K. M. 
Bykov (1925), the rhythm of the 
movements does not always coincide 
with the rhythm of the metronome; 
some subjects offer slower move- 
ments, some more rapid, and some 
again make movements at an irregu- 
lar rate, while in only 49 per cent of 
the subjects was the rate of the move- 
ments coincident with the beats of 
the metronome. According to By- 
kov’s findings, the number of excess 
movements after cessation of the 
metronome did not exceed three in 
the majority of subjects (76 per cent), 
although in some cases it reached 
nine. Both Bykov and Zelinyi regard 
these ‘‘excess’” movements as the re- 
sult of the development of a condi- 
tioned reaction to time. 

Detailed investigations of the same — 
phenomenon were carried out re- 
cently by M. A. Alekseev (1953) 
with various rates of movements 
(from 20-30 to 200 per minute, that 
is with intervals between successive 
conditioned signals of from 0.3 to 
two-three seconds). His observations 
showed that supplementary motor re- 
actions after cessation of the rhyth- 
mical signals only developed when 
the intervals between the conditioned 
signals were less than one second. Re- 
garding these reactions as a mani- 
festation of a conditioned reaction to 
time, the author also points to other 
forms of this phenomenon (shorten- 
ing of the latent period of the motor 
reactions, frequent anticipation of 
the next signal). With intervals ex- 


ceeding one second (1.6—3 seconds) 
no supplementary movements ap- 
peared as a rule, that is, the condi- 
tioned reaction to time was inhibited. 
In these cases a sharp fall in cortical 
excitability was seen immediately 
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after the conditioned signal, to be re- 
placed by heightened excitability as 
the moment for the action of the next 
signal approached. This the author 
regarded as a “‘latent’’ manifestation 
of the conditioned reaction to time. 
Assuming that the development of a 
conditioned reaction to time is de- 
termined by the degree of concentra- 
tion of the conditioned excitation in 
the period leading up to the next rein- 
forcement, the author explains the 
differences in the formation of this 
reaction at different intervals as fol- 
lows: in the presence of large intervals 
following negative induction, excita- 
bility increases slowly, while in the 
presence of short intervals it rises 
steeply. Since between the condi- 
tioned signal (beat of the metronome) 
and the complex stimulus which in- 
duces the conditioned reaction to 
time, there occur, according to the 
author, active induction relation- 
ships, it follows that in the case of 
longer intervals (as a result of the 
slow increase in excitability), the ex- 
ternal signal proves stronger, and the 
reaction to time isinhibited. Whereas 
with short intervals (in consequence 
of the steeper rise in cortical excita- 
bility) ‘‘ . . . this state of the cortical 
cells emerges as an independent con- 
ditioned stimulus, inhibiting the re- 
action to the sound and inducing the 
motor reaction” (Alekseev, 1953, p. 
896). 

The formation of conditioned re- 
flexes to time has been noted also in 
the course of investigations on the 
daily periodicity in the physiological 
functions of man. Thus, investiga- 
tions of daily variations in the com- 
position of the blood in man have 
shown that in many people the num- 
ber of leucocytes increases at the 
usual hours of eating (Voronov & 
Riskin, 1925; Orlova, 1937), and that 
this increase is the result of the for- 
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mation of a conditioned reflex to the 
time of taking food (Belen’kii, 1949). 
A. G. Urin and E. S. Zenkevich 
(1952) stated that not less than six— 
seven days were required for the for- 
mation of a conditioned leucocytic re- 
flex to the time of taking food. When 
the time of eating was changed, the 
conditioned leucocytic reflex was ex- 
tinguished after two-four days, and 
the longer the conditioned Teflex had 
taken to become established, the 
slower was its extinction. The extin- 
guished reflex was restored rapidly on 
return to the previous conditions of 
eating. 

The features attaching to the for- 
mation of a conditioned reaction to 
time in man were investigated in our 
laboratory by A. S. Dmitriev. The 
observations were made on children 
aged 8 to 14 years. Conditioned 


motor reactions were established in 
the subjects to sounds with verbal 
reinforcement, the conditioned stim- 
ulus being repeated at equal inter- 


vals of time (25-30 seconds). 

In many of the children the forma- 
tion of a conditioned reaction to time 
began with the appearance of inter- 
signal motor reactions (after 5-13 
repetitions), and indeed these reac- 
tions appeared also at times later 
(every five-six repetitions or even 
more frequently) at quite varied 
times in the intersignal interval. The 
character of the distribution of all the 
intersignal reactions in one of the 
subjects in the first experiment is 
shown by the dotted line curve in 
Fig. 3. In subsequent experiments, 
however, the number of intersignal 
reactions diminished, and appeared 
mainly in the second half of the inter- 
signal interval (as is shown by the 
continuous line curve in Fig. 3). Fi- 
nally, a more or less differentiated 
conditioned reaction to time was 
formed, and the intersignal reactions 
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STAGES OF FORMATION OF THE MOTOR REACTION TO TIME IN CHILDREN (SuBJECT T.K.). 

The dotted line shows the distribution of the total number of intersignal reactions in the 
25-second intersignal interval seen in experiment No. 1, and the solid line shows the distribution 
of the total number of intersignal reactions in experiment No. 2. 


appeared two-three seconds before 
the next conditioned signal; when the 
next signal was omitted, reactions 
often developed at the time when it 
was due. Thus, the process of forma- 
tion of a conditioned reaction to time 
passes through the same stages in 
children as we observed during the 
formation of a motor-defence condi- 
tioned reflex to time in dogs (cf. Fig. 
1 and 3). The duration of the stages 
and their relationships to one another 
differed in different children, as the 
graphical illustrations in Fig. 4 show. 
The total time required for the for- 
mation of a differentiated condi- 
tioned reaction to time in children 
varied from 29 to 82 repetitions. 

It must be noted, however, that the 
formation of a conditioned reaction 


to time in this form was noted pre- 
dominantly in children of early school 
age. But some of these intersignal re- 
actions appeared in small numbers 
even after a large number of repeti- 
tions, a fact which prevented exact 
observation of all the stages in the 
formation of a conditioned reflex to 
time. In children of middle school 
age the conditioned reaction to time 
appeared in the form of intersignal 
reactions in only 42.8 per cent of all 
cases; in most no intersignal reac- 
tions whatever were seen. It was, 
nevertheless, possible to establish a 
conditioned linkage to time in this 
group of children, but by a different 
method: being told (after four—five 
experiments) to make independent 
pressures at the same intervals as in 
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the preceding experiments, they pre- 
served with a fair degree of accuracy 
the usual intersignal interval be- 
tween the pressures, which was possi- 
ble only in the presence of a more or 
less exact reflection of this interval in 
the child’s cerebral cortex. The older 
the children, the greater was the ac- 
curacy with which this task was car- 
ried out. 

These investigations enabled us to 
conclude that, in the case of children 
of early school age, first signal system 
associations occupy a _ prominent 
place in the formation of the condi- 
tioned reaction to time, while in the 
case of children of middle school age 
second signal system connections 
play the leading part in the forma- 
tion of the conditioned time reaction, 
inhibiting through negative induc- 
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tion the appearance of the condi- 
tioned reaction within the limits of 
the first signal system in the form of 
intersignal motor reactions. This 
feature in the formation of condi- 
tioned reactions to time should, 
obviously, be more pronounced in 
adults. 

There is no doubt that investiga- 
tions of conditioned reactions to time 
in man are important for the solution 
of a number of practical problems. 
Such problems include the establish- 
ment and stabilization of a rational 
regime of work and rest, the arrange- 
ment of the school-day for children, 
the rhythm of industrial processes, 
nutritional regimes, etc. Investiga- 
tions of conditioned reactions to time 
in man will lead to discovery of the 
physiological mechanism involved in 
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the perception and measurement of 
time and its features in individuals 
of different ages, and such informa- 
tion might be used to advantage in 
training and education. Thus, fur- 
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ther study of the conditioned reac- 
tions to time in man is closely bound 
up with the solution of important 
practical problems in medicine, psy- 
chology, and education. 
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There is one aspect of the general 
problem of the effect of high intensity 
noise on man that has been insuffi- 
ciently explored; this concerns the 
question of the effects of irregular or 
interrupted sound on performance, 
feeling, and physiology. Evidence 
which has been accumulating during 
the past few years suggests that such 
intermittent sound has both different 
and more disturbing effects on Ss 
than those of steady sound sources. 
Such differences may, in addition, 
have important theoretical implica- 
tions for the understanding of brain 
mechanisms. 

This report will, therefore, attempt 
a review and analysis of the literature 
dealing with the problem of the ef- 
fects of interrupted sound on various 
measures of human functioning, par- 
ticularly as related to perceptual- 
motor activity, subjective report, and 
body physiology. 

In 1946, Berrien published a re- 
view of the effects of noise as applied 
mainly to industrial environments 
and reported that, although much of 
the evidence was inconclusive, there 
were some indications that 
tended to affect work output and 
speed of work. This review noted 
that there are marked individual dif- 
ferences in susceptibility to the ill- 
effects of noise. No attempt to dis- 
tinguish the effects of intermittent 
from steady noise was made except to 
note the increased unpleasantness of 
intermittent sounds. 


noise 


1 Supported by Contract Nonr-2252(01) 
with the Office of Naval Research. Reproduc- 
tion for any purpose of the United States 
Government is permitted. 


Kryter in 1950 published an ex- 
tensive review of the literature of the 
effects of noise on men. His conclu- 
sions from studies conducted under 
laboratory conditions indicated that 
experimental studies can be grouped 
under three categories: 

1. Experiments which demon- 
strated deleterious effects of noise. 
Nearly all, if not all, can be heavily 
criticized on one or more points so 
that findings can be accepted only 
with considerable reservations. 

2. Experiments which demon- 
strated slight, inconsistent or incon- 
clusive detrimental effects from noise. 

3. Experiments that demonstrated 
that man can do certain types of 
muscular and mental work as efh- 
ciently and productively in noise as 
in quiet, even for prolonged periods. 
For some few tasks, noise apparently 
improved performance. 

However, it is obvious that previ- 
ous studies have not sampled all the 
different types of behaviors of which 
man is capable. Kryter also notes 
that different kinds of sounds may 
have marked effects on feelings as 
well as on certain physiological meas- 
ures. These will be described in later 
sections of this review. 

Since Kryter’s report, a number of 
studies have appeared which indicate 
that noise can affect performance. 

An experiment by Miller (1953) 
tested the effects of a 90 db,? 8000 cps 
tone on the four measures of critical 
flicker fusion, cancelling c’s, word 


? All designations of intensity in this report 
will be decibels with reference to 0.0002 
dynes/cm. sq. =0 db unless otherwise noted. 
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fluency, and trembling. Only the last 
measure showed any effect (i.e., in- 
crease in trembling) as a result of the 
sound. In the Benox report, Miles 
(1953) indicated that Ss exposed toa 
115 db sound for three hour periods 
showed an impaired performance on 
Coordinated Serial Reaction Time— 
i.e., on a task in which the S directed 
a beam of light at a series of targets 
by means of airplane controls. This 
was reported as being the most effec- 
tive of all the tests for showing an 
effect of noise on performance. 
Broadbent (1951a) described an ex- 
periment in which the S was required 
to observe and correct any pointer 
that exceeded the danger mark on a 
series of steam pressure gauges. The 
pointer signals were presented at ran- 
dom in a series of test runs each 90 
minutes long. Ten Ss participated, 


each for five days, and were exposed 
in the sequence: quiet, quiet, noise, 
noise, quiet. 


Noise levels for quiet 
were 70 db, and noise 100 db. The 
results indicated that Ss performed 
more poorly on the noise days than 
on the quiet days preceding and fol- 
lowing the noise sessions. When the 
task was made simpler (the dials 
were replaced by lights whose bril- 
liance was sharply intensified at ran- 
dom as a signal) another group of Ss 
showed no significant effect of noise. 

Other studies by Broadbent (1951b, 
1953, 1954) were designed to investi- 
gate the effects of pacing on the per- 
formance of a vigilance task under 
continuous 100 db noise conditions. 
More errors were made on the task 
under noise conditions than under 
control conditions in each experi- 
ment. However, the results of the 
pacing were inconsistent in two 
studies. In one study it was found 
that fewer errors were made by the 
group under unpaced conditions 
(stimulus presented after every re- 
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sponse) than by the group under 
paced conditions (stimulus presented 
every second). In the other study no 
difference was found between paced 
and unpaced performance. Unlike 
the first study, however, in the latter 
task, performance was _ prolonged 
over a one and one-half hour period 
suggesting that the Ss may have 
adapted to the pacing and noise con- 
ditions. More errors were made un- 
der the noise condition. 

Using a modification of the above 
procedure, Jerison and Wing (1957) 
had Ss watch three clocks side by 
side whose pointers made double 
jumps on a random basis of about 
one per minute. The Ss watched the 
clocks during two hour sessions and 
pressed a key when each double jump 
occurred. Comparing a ‘‘quiet’’ con- 
dition of 83 db with a noise condition 
of 114 db produced by a loudspeaker 
with frequencies of 20 to 9600 cps, 
they found that. performance de- 
creased significantly in the final half- 
hour of the noise condition. No 
change in performance occurred dur- 
ing the two-hour ‘‘quiet’’ condition. 

In another clock-watching experi- 
ment Jerison (1956) reported a 
definite impairment of performance 
with increased time in the task situa- 
tion, and concluded that this was due 
partly to noise, partly to fatigue, and 
partly to the unreliability (test-re- 
test) of the task itself. An important, 
but unexpected observation was that 
Ss who showed marked auditory fa- 
tigue after the test tended to perform 
at the same level in noise as in quiet, 
whereas those Ss with only mild hear- 
ing losses tended to fall off from their 
starting performance level. This find- 
ing emphasizes the important fact of 
individual differences. 

Another effect of noise that has 
very recently been discovered is the 
fact that time estimation is affected 
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by noise levels. Hirsh, Bilger, and 
Deatherage (1956) have reported 
that estimates of durations up to 16 
seconds were systematically distorted 
if the noise levels during the presen- 
tation of the interval and during the 
Ss’ attempt at reproducing that in- 
terval were different. The change 
was in the direction of overestimat- 
ing the interval when the noise level 
was higher during the reproduction 
period and underestimating the inter- 
val when the reverse was the case. 
Jerison and his co-workers (Jeri- 
son, Crannell, & Pownall, 1957) also 
found an effect of noise on time esti- 
mation. Two hundred Ss, working in- 
dividually, were required to follow a 
moving target visually and to imag- 
ine the continuing movement of the 
target after it disappeared. When the 
target was believed to have reached a 
crosshair, the S squeezed a trigger. 
Noise of about 110 db (frequency 
range: 20—10,000 cps) was introduced 
at certain times. It was found that a 
noise program in which it was quiet 
when the target disappeared gave 
longer judgment times relative to 
those obtained under control condi- 
tions of quiet or noise throughout. 
The opposite program of ‘‘noise then 
quiet’”” was not differentiated from 
the control conditions. It was also 
found that judgment times became 
longer in succeeding trials under all 
four noise conditions. An effect of 
noise on subjective time judgments 
has also been noted by Loeb (1957). 
In addition to these recent studies 
that indicate an effect of noise on 
certain kinds of performance, there 
have been a number of observations 
on the effects of intermittent noise on 
performance. Many years ago Cassel 
and Dallenbach (1918) found that an 
intermittent noise resisted habitua- 
tion more than a continuous one. 
Similarly, Laird (1933) reported that 


135 


Ss who were required to put a stylus 
through small irregularly spaced and 
sized holes in a moving tape, showed 
that the drop in output was greater 
when the noise distractor was made 
to vary in intensity in an eight- 
second cycle than when its loudest 
component was presented as a steady 
sound. 

In two recent studies, K. R. Smith 
(1950, 1951) compared the per- 
formance of two groups of college 
students on number checking, name 
checking and form board tests. The 
experimental group was exposed to a 
series of 100 db intermittent sound 
bursts of 10 to 50 seconds in length, 
but so arranged that the total noise 
time was equal to the total silence 
time during the 30-minute period of 
the tests. It was found that the ex- 
perimental group tried more items, 
scored more correctly, scored more 
incorrectly, and was less accurate 
than the control group. Some of the 


differences were small but significant. 
One other interesting fact emerged: 


with two exceptions, the experi- 
mental group was more homogeneous 
(i.e., smaller SDs) than the control 
group. 

Another experiment which reports 
an effect of intermittent noise on per- 
formance is that of Corso (1952). He 
used a 100 db, 100-3000 cps noise, 
which was introduced intermittently 
throughout the test period while the 
Ss worked on the Minnesota Clerical 
Test and the Minnesota Form Board 
Test. Here, just as in the previously 
cited experiment by K. R. Smith, the 
Ss attempted more test items, got 
more correct, and made more errors, 
even though their performance was 
reported as less variable under the 
noise than under the control condi- 
tions. (Those who had performed 
well on the tests under control condi- 
tions tended to do poorer under the 
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stress, while on the average, those 
who had performed poorly in the 
control session improved their scores 
under stress.) 

In summary then, it is possible to 
show that recent research indicates 
that high intensity noise has an effect 
on certain types of complex tasks, 
and that intermittent noise, in the few 
cases where it has been studied, seems 
to have a greater tendency to impair 
performance than steady noise. 

On the basis of this past research, 
three hypotheses are offered: 

1. Performance impairments are 
more likely to occur during high in- 
tensity intermittent sound than un- 
der lower intensity or steady sound. 

2. The relative difficulty of »the 
task is an important variable deter- 
mining the effect of noise on per- 
formance. The more difficult the 
task, the more likely will noise be 
disruptive. 

3. Individual differences in reac- 
tion to noise are extremely important 
to consider in evaluating research. 
There is evidence to indicate (Plut- 
chik, 1955) that the mean of a group’s 
responses may be misleading if there 
are marked individual differences or 
subgroups within the larger group. It 
is hoped that these hypotheses will 
be systematically tested in subse- 
quent research. 

One other major question is in- 
volved in a study of the effects of 
noise on performance. It is evident 
that a high intensity noise of what- 
ever form can be considered a stress 
stimulus, and the literature dealing 
with stress should have relevance to 
noise and performance studies. A 
recent review and critique of studies 
of stress (Harris, Mackie, & Wilson, 
1956) presents the following findings 
and conclusions: 

1. There are wide individual dif- 
ferences in reactions to stress. The 
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reasons for these different reactions 
have not been clearly identified. 

2. The majority of the studies 
have been concerned with the effects 
of relatively short-term stress condi- 
tions, which means that temporary 
compensatory performance is possi- 
ble. In many cases, the durations of 
the stress stimuli have not been re- 
ported. 

3. Apparently no one has syste- 
matically investigated the relation 
between the intensity of the stress 
stimulus and its effect on behavior. 

4. Few investigators have at- 
tempted to observe the course of be- 
havior during stress. 

5. Of the many different experi- 
mental designs which have been used, 
it is suggested that the subject- 
control design which compares a per- 
son’s performance under both stress 
and non-stress conditions, is more 
satisfactory than a random group or 
group-control design method. 

In the light of these observations it 
is evident that much further research 
is needed to clarify inconsistencies 
and to develop a theoretical rationale 
for the results which have been re- 
ported. 


THE EFFECTS OF INTERMITTENT 
SOUND ON FEELINGS 


There are at least three aspects to 
this problem that require examina- 
tion: (a) the nature of the feelings 
that are associated with high in- 
tensity noise; (b) the effect of such 
noise on threshold, adaptation, and 
auditory acuity; and (c) the special 
subjective characteristics of inter- 
mittent, repetitive, or pulsed sounds. 

With regard to the first aspect, the 
literature is almost unanimous: high 
intensity noise, even when it may 
have no effect on performance, will 
generally produce symptoms of dis- 
comfort, irritability, and distraction. 
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To illustrate: In the Benox report 
(Miles, 1953) a steady noise of 115 db 
heard for three-hour periods pro- 
duced fatigue and discomfort; Miller 
(1951) had his Ss listen to 111 db 
noise for 30-minute periods and they 
reported irritation, distraction, and 
general disturbance. Blau (1951) re- 
ports that a high intensity noise 
source of 103 db which accompanied 
the administration of various tests 
had practically no effect except to 
tend to arouse “somatic complaints 
of specific anatomical location and 
description,’ and Mendelson and 
Conway (1947) who exposed 10 
volunteers to jet engine noise for 14 
days and a total of 19 hours (at about 
120 db over the range of 20-8000 
cps), found that seven of the 10 Ss 
reported fatigue, irritability, and 
nervousness. 

In spite of this agreement on the 
subjective effects of high intensity 
noise, there have been very few 
studies designed to isolate the more 
disturbing aspects of the noise spec- 
trum. One of the few dealing with 
this problem studied the relative an- 
noyance produced by various bands 
of noise (Reese & Kryter, 1944). The 
authors used filter systems to divide 
‘“‘white”’ noise into several bands, and 
five Ss were asked to adjust each 
band of noise to equal a standard 
band (of 1900 to 2450 cps at 94 db) 
in “annoyance”’ value. It was found 
that frequencies above 2000 cps were 
more annoying than those below it. 
The results showed that ‘annoy- 
ance” as a characteristic of sound is 
discriminable from loudness, al- 
though with continued testing the 
annoyance and loudness contours be- 
came less separated. 

One other early study dealing with 
this problem (Laird & Coye, 1929) 
had 14 Ss compare the relative an- 
noyance of eight different frequencies 
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ranging from 64 to 8192 cps, by the 
method of paired comparisons. A 
U-shaped curve resulted with fre- 
quencies of 256 to 1024 least annoy- 
ing, with a marked rise in relative 
annoyance for lower and higher fre- 
quencies. The authors conclude that 
at low intensities annoyance is ap- 
proximately proportional to loudness 
while at high intensities (over about 
80 db), the frequencies below 500 cps 
follow the equal loudness curves, 
while frequencies over 500 are equally 
annoying at lower loudness levels. 
Thus, annoyance is different from 
loudness per se. 

Kryter summarized the findings 
dealing with the annoyance value of 
various noises by citing the aspects of 
a sound which tend to affect annoy- 
ance value. These are: (@) unexpect- 
edness, (6) inappropriateness, (al- 
though this is a term which is diffi- 
cult to specify), (c) intermittency 
(irregular, variable sounds are more 
annoying than steady ones), (d) re- 
verberation (lack of localizing in- 
creases annoyance), (e) loudness (the 
threshold at which any sound be- 
comes annoying has not yet been un- 
equivocally determined), (f) fre- 
quency pattern (sounds having their 
energy concentrated in the higher 
audible frequencies are more an- 
noying). 

So far as it has been possible to de- 
termine, the relative annoyance 
value of different kinds of intermit- 
tency has not been studied. 


FATIGUE EFFECTS OF HIGH 
INTENSITY SOUND 


In spite of many reports on the 
after effects of high intensity noise, 


including the extensive study by 
Davis (1942) during World War II 
using intensities up to 130 db and 
durations as long as 64 minutes, the 
problem has not yet been entirely 
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clarified. In general, it is known that 
in auditory fatigue, duration of stim- 
ulation has a cumulative effect from 
30 seconds up to at least 10 minutes 
(Hallpike & Hood, 1941; Harris, 
1953); that the maximum fatigue ef- 
fects of a given frequency may be a 
half octave higher (Davis et al., 
1943); and that there are very 
marked individual differences in sus- 
ceptibility to auditory fatigue (Har- 
ris: 1953, 1954; Wilson, 1950). The 
existence of marked individual dif- 
ferences are attested by the fact that 
the least susceptible Ss may return to 
normal in less than seven minutes 
from stimulation which the most 
susceptible do not recover from in 
more than 24 hours (Harris, 1953). 
Hearing loss, according to Harris 
(1953) tends to be a linear function 
of both stimulus duration and stim- 
ulus intensity, but for fatigue-re- 
sistant individuals recovery from 10 
minutes of exposure to noise levels of 
120 db takes only about seven min- 
utes. Hearing loss for relatively 
short exposures, seems to be com- 
pletely reversible. For example, a 
recent study (Thwing, 1956) re- 
ported that the adaptation produced 
by a 70 db tone of 1000 cps presented 
for six minutes is followed by com- 
plete recovery in about one minute 
after the termination of the adapting 
stimulus. 

A test for screening fatigue-sus- 
ceptible individuals has been re- 
ported in the literature by Wilson 
(1950). He exposes the S to a 2048 
cps tone at 80 db intensity for five 
minutes, and threshold shifts great 
enough to prevent the S from hearing 
a tone 10 db over his own threshold 
after two minutes recovery are used 
as a criterion of fatigue. Harris 
(1954) suggests that a better criter- 
ion is either the time in seconds to re- 
turn to within 5 db of own threshold, 
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or the residual hearing loss after one 
minute. In both studies, the test 
tone is presented at 4096 cps since 
this is the region most affected by 
stimulation at 2048 cps. It is of some 
interest to note, in this connection, 
that when white noise, with a fre- 
quency band over most of the audi- 
tory spectrum, was used at 100-120 
db for several minutes, the frequency 
of maximum fatigue, for all observ- 
ers, was reported at between 4000 
and 8000 cps (Hirsh & Ward, 1952). 

Another more recent test of noise 
susceptibility has been proposed by 
Jerger and Carhart (1956) on the as- 
sumption that an ear’s tendency to 
develop a permanent hearing loss is 
related to that ear’s reaction to tem- 
porary acoustic stress. They found 
that out of 178 Air Force jet- 
mechanic trainees only 15 (or eight 
per cent) showed a hearing loss of 10 
db or more on audimeter tests in the 
3000-4000 cps range, eight weeks 
after a three day (12 hour) exposure 
period to jet-engine noise (SPL level 
unspecified, but probably in the 
range of 120-140 db). There was a 
slight but significant positive correla- 
tion between the amount of tem- 
porary threshold decrease to a 3000 
cps, one minute duration tone, at 100 
db, and the hearing loss found after 
eight weeks. 

Several relatively recent studies 
have attempted to determine the ef- 
fects of jet-engine noise on hearing 
loss. Mendelson and Conway (1947) 
used 10 Ss exposed to sound inten- 
sities of about 120 db for a total of 19 
hours, and reported that hearing 
losses of 20 to 60 db at frequencies of 
512 to 2048 cps vanished after a 
week-end’s rest and did not return. 
Davis et al. (1953) exposed 17 men to 
21 bursts of jet-engine noise (at 126— 
150 db) each lasting 15 seconds, and 
separated by 30-45 second intervals 
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of relative silence. Audiograms taken 
two to three hours after exposure 
showed no temporary loss in any S, 
although the authors point out that 
people who come in constant contact 
with jets show definite hearing losses. 
Eldred and his co-workers (Eldred, 
Gannon, & Gierke, 1955) found that 
a one minute exposure to jet noise at 
130-135 db produced a slight hearing 
loss followed by complete recovery 
within eight hours. 

In the Benox report (Miles, 1953) a 
study is cited by Silverman (1947) in 
which he was able to locate the pain 
threshold for both normal and hard 
of hearing ears at about 140 db using 
earphones. The sensation of pain 
(“it hurts’’) was distinguished clearly 
from sensations of auditory discom- 
fort (“it is too loud’’), and of touch 
(‘‘it tickles,’’ or ‘‘I feel something in 
the ear’’). 

The Benox researchers compared 
the pain threshold in a free field, us- 
ing jet engines as noise source, with 
pain threshold determined by ear- 
phones and concluded that they were 
the same. They also noted that in 
the frequency range between 800 and 
2000 cps the sound became ‘‘uncom- 
fortably loud’’ at sound levels well 
below the pain threshold. This latter 
finding is consistent with the work of 
Hardy (1952) who points out that 
certain sounds, even 85 db, if pro- 
longed over months and years may 
cause some degree of deafness. He 
concludes that sounds which exceed 
100 sones per octave band are prob- 
ably damaging with long-time daily 
exposure but that no damage is ex- 
pected if no octave band exceeds 50 
sones. 

It may be concluded from these 
various observations and reports that 
the short time exposures to high in- 
tensity noise levels in the laboratory 
are not likely to produce any kind of 
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permanent hearing losses, and that 
the effects which do appear are very 
transient, although there may be 
some marked individual differences 
in the speed of recovery. 


SoME SUBJECTIVE CHARACTERISTICS 
OF REPETITIVE SOUNDS 


When pure tones are presented 
repetitively for very brief durations, 
then at least two important phenom- 
ena come into existence. If the dura- 
tion of each tone is extremely short, 
the tone is heard as a click which 
does not have a discernible fre- 
quency. As the duration is gradually 
increased a “‘click-pitch’’ threshold 
is found (i.e.) the shortest duration 
of a tone which allows the tone to 
have some pitch character to it), and 
then a ‘“‘tone-pitch’ threshold is 
found (the shortest duration of a 
tone in which the tonal character, 
rather than the click character is 
dominant). The exact values of 
these thresholds decrease with fre- 
quency, within the range of 125-8000 
cps, but in no case is it greater than 
18 milliseconds for the click-pitch 
threshold, or 25 milliseconds for tone- 
pitch (Doughty & Garner, 1947). 

The second phenomenon referred 
to is a result of the abrupt presenta- 
tion or removal of a tone. If a pure 
tone is interrupted at a given fre- 
quency, not only is the original tone 
obtained but also additional fre- 
quencies which are a function of the 
rate of interruption. A square wave 
pulse modulation produces a spec- 
trum of sideband frequencies with 
maximum energy in the central com- 
ponent. Changing the repetitive 
rate or the duration of a_ pulse 
changes the spectrum of energy. Ac- 
cording to Garner (1947b) energy 
changes at the rate of 6 db in the 
central component for every doubling 
or halving of either the repetition 
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rate or duration; each doubling or 
halving of the repetition rate or dura- 
tion changes the total energy by 3 db. 
The threshold response is primarily 
determined by the duration of the 
pulse, and not the repetition rate, for 
low rates. This spectrum of sideband 
frequencies adds a click to any pure 
tone that may be used. 

Although some studies (Garner: 
1948, 1949) have made no attempt to 
eliminate or deal with the click intro- 
duced by the abrupt onset or de- 
crease of a tone, two remedial pro- 
cedures have been mentioned in the 
literature. One way to suppress the 
click due to switching the tone on and 
off is to produce a gradual increase 
and decrease of the tone, rather than 
to use a square wave (Luscher & 
Zwislocki, 1949); Munson (1947) 


used a three millisecond rise and fall 
time for this purpose while Miller 
and Heise (1950) used a 20 millisec- 
ond rise and fall time. Another way 
of dealing with this problem is to use 


a wide-band noise source with a uni- 
form spectrum (‘‘white-noise’’) as the 
interrupted tone because the side- 
bands produced by the interruptions 
fall within the existing spectrum and 
do not produce a change in the sub- 
jective character of the tones. This 
procedure was used by Pollack (1941) 
and by Miller and Taylor (1948). 

Interestingly enough, the early 
study by Shower and Biddulph 
(1931) compared the effect on rela- 
tive discrimination of the transients 
introduced by abrupt switching on or 
off of a tone. They found that the 
only effect was a decrease in the 
Weber ratio for frequencies below 500 
cps. This indicates that the effect, 
if any, of a click cannot be assumed 
to necessarily disrupt or inhibit some 
auditory function, without investiga- 
tion. 
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When tones are presented repeti- 
tively in brief pulses, there are sev- 
eral parameters that need to be speci- 
fied in addition to the intensities and 
frequencies used. There is the dura- 
tion of the tone, the repetition rate, 
the percentage of the total time that 
the tone is on, which represents an 
on-off ratio, and the total energy in 
decibels (Garner, 1948). Miller 
(1948) has called the on-off ratio by 
another name, the sound-time frac- 
tion, in a study of auditory ‘“‘flutter’’ 
In his study he reports that in general 
the same types of relations for audi- 
tory ‘‘flutter’’ hold as do those for 
visual flicker with the exception that 
the critical flutter frequencies (135 
bursts per second for one S and 270 
bursts per second for the other) are 
much higher for fusion than are vis- 
ual flicker rates. 

In an extension of this work on 
auditory flutter, Pollack (1952) used 
flutter rates from 0.4 to 200 bursts 
per second, with five Ss. When the 
relative change in flutter rate (that 
is, Af/f) is plotted against flutter fre- 
quency, a minimum is found in the 
region of 10 per second. Pollack 
hypothesizes that this is not unre- 
lated to the fact that the alpha 
rhythm of the brain is also about 10 
per second. 

Several interesting studies have 
been reported dealing with the ability 
to count short repetitive pulses, and 
with the loudness of pulses in com- 
parison to steady tones. Taubman 
(1944), for example, had his Ss judge 
the number of dots that would be 
sounded while different numbers of 
dots, from one to six were presented 
at either 10 per second, 14 per second 
or 18 per second rates. He found rela- 
tively little difference in the number 
of dots judged at these three different 
rates, although knowledge of results 
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seemed to help. His data suggest 
that these three rates cannot be 
easily discriminated. 

Consistent with this finding is the 
work of Cheatham and White (1954) 
and Garner (1951). Cheatham and 
White used a 1000-cycle tone pre- 
sented in pulses of 10 per second, 15 
per second, and 30 per second, with 
each pulse of 11-millisecond duration 
and 70 db intensity. For four Ss, it 
was found that regardless of the ob- 
jective rate used, the subjective rate 
for auditory perception of sound 
pulses approaches a limit of about 9 
to 11 pulses per second. In addition 
it was noted that the variability of 
response increases suddenly for mean 
perceived numbers higher tlian five. 
Garner used a similar task and pro- 
cedure, and reports the following con- 
clusions: (a) the duration (5-40 
msec) and intensity (up to 94 db) of 
the tone had no effect on counting 
accuracy; (6) the curve for a repeti- 
tion rate of 12 per second is almost 


identical with that for a 10 per second 
rate; and (c) there are very large in- 
dividual differences in counting ac- 
curacy. 

Garner has also reported in an- 


other study (1947b) dealing with 
threshold in relation to repetition 
rate, that as the repetition rate in- 
creases, the threshold decreases, for 
all frequencies used (250-4000 cps) 
although there is a break in the curve 
at rates between two and five tones 
per second. Garner notes that al- 
though the total energy in a stimulus 
is directly proportional to the repeti- 
tion rate, it has been shown that the 
ear does very little integrating of 
acoustic energy beyond a duration 
of 200 milliseconds (or five pulses per 
second). 

Two other studies are relevant to 
this problem. Mowbray and his co- 
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workers (Mowbray, Gebhard, & By- 
ham, 1956) used 10 interruption rates 
of white noise, ranging from 1 to 320 
bursts per second at an intensity of 
70-80 db. These pulse rates were 
given to the Ss as standards and were 
to be matched by a variable pulse 
rate. When the average deviation of 
the matchings were plotted against 
pulse rate on logarithmic coordinates, 
two functions were revealed, sug- 
gesting a change in the method of fre- 
quency discrimination at about five 
bursts per second. This was inter- 
preted as meaning that the listener is 
able to count the noise bursts from 
one to five per second but not above 
five. 

This interpretation is nearly but 
not quite consistent with the results 
of an experiment by Licklider and 
others (Licklider, Webster, & Hed- 
lun, 1950) dealing with the threshold 
for disappearance of binaural beats 
produced by pure tones. These ex- 
perimenters found that the subjective 
character of, beats changes in the 
range between 2 and 10 beats per 
second from periodic fluctuation to 
“roughness,” depending upon fre- 
quency. When Ss were asked to 
count the beats, the curve relating 
threshold to frequency was nearly a 
straight line at 8 beats per second. 
These studies, therefore, suggest that 
the counting ability of a human ob- 
server of tones, pulses, or beats has 
an upper limit definitely less than 10 
and probably less than 8 per second. 

In addition to these facts concern- 
ing the ability to discriminate pulse 
frequencies, there are several reports 
on the loudness of brief repetitive 
tones. Garner (194°) had six Ss 
equate the loudness © a series of re- 
peated short tones with the loudness 
of a steady tone of the same fre- 
quency using an 80 db intensity 
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level. His results showed the follow- 
ing: 

1. Aseries of pulses may be louder 
than a steady tone of the same in- 
tensity level. 

2. There is a maximum difference 
in loudness, in favor of the repeated 
short tones, at about 60 db. At 
higher and lower intensity levels, the 
relative advantage in loudness for 
pulse is less. 

3. The loudness of pulses is rela- 
tively highest between 1000 and 4000 
cps. 

4. The most consistent matching 
of loudness is shown at 5-10 pulses 
per second, although a decrease in 
pulse duration tends to increase vari- 
ability of judgments. 

5. Loudness increases much less 
rapidly than does intensity. 

In another related experiment deal- 
ing with binaural loudness matching 
with short tones, Garner (1947a) re- 
ported that the accuracy of loudness 
matching is best both below and 
above tone durations of 20 millisec- 
onds (with repetition rates of 1-10 
per second), and that repetition rates 
from 1-100 pulses/second had little 
effect on the variability of loudness 
matching. He notes as do many 
other investigators, that the differ- 
ences between individuals are much 
greater than the differences within 
individuals. 

In a study designed to study the 
loudness of noise rather than pure 
tones, Pollack (1941) reported find- 
ings somewhat similar to Garner's. 
He found that the interrupted noise 
sounds louder than a continuous 
noise of the same energy, being rela- 
tively greatest at rates of 2-10 pulses 
per second; and that the absolute 
threshold of a white noise (of con- 
stant sound-time fraction) increases 
as the frequency of interruptions in- 
creases from 1—300 pulses per second. 
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Pollack points out that the range of 
2-10 pulses per second is the range 
wherein there is enhancement of vis- 
ual brightness of a flickering light; it 
is also where the most acute intensity 
discrimination is found between two 
tones as a function of the difference 
in frequency between the tones. This 
same range also covers the alpha 
rhythm and its first subharmonic. 

In summary it may be concluded 
that humans have certain limitations 
with regard to the counting of rapid 
pulse signals. These limits include a 
range of about 1-12 pulses per sec- 
ond, a range in which certain interest- 
ing, threshold, loudness, and varia- 
bility data can be found. It may be 
stated as an hypothesis that such 
findings concerning auditory percep- 
tion may be related to the alpha 
rhythm and various visual phenom- 
ena associated with it. 


PHYSIOLOGICAL EFFECTS OF 
Loup SouNDs 


There are many published reports 
which indicate an effect of loud or 
repetitive sounds on physiological 


function. In his review of 1950, 
Kryter cites two earlier studies, one 
which found a rise in blood pressure 
following a loud unexpected sound 
(Lovell, 1941), and the other, which 
reported a decrease in peristaltic con- 
tractions and flow of gastric juices 
(Smith & Laird, 1930) following two 
10-minute periods of noise. 

More recently, Corso (1952) has 
stated that Ss exposed to intermit- 
tent noise (800—-10,000 cps) at 105 
db while taking an intelligence test 
showed a significant increase in pulse 
rate during the noise condition, and 
Stambaugh (1950) reported the same 
result. Davis (Davis, Buchwald, & 
Frankman, 1955) found that when 
auditory stimuli near the pain thresh- 
old in intensity (about 120-130 db) 
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are presented to an S at one-minute 
intervals, a complex pattern of re- 
sponses occurs, which includes in- 
creases in palmar-sweating, EMG, 
and respiration amplitude, decreases 
in pulse and finger volume and res- 
piration rate, and an increase and 
then a decrease in pulse rate. 

In addition, several studies have 
found a definite effect of loud sounds 
on muscle tensions. Davis (1948) 
used a 500 cps tone at an intensity 
level of about 90-100 db, presented 
to the Ss for two- or four-second in- 
tervals every two minutes. Although 
the Ss were instructed to do nothing 
in response to the stimuli, definite 
muscle tension responses were re- 
corded which were partially depend- 
ent in magnitude on the level and 
geographic distribution of tension 
already existing in the S before the 
stimuli were applied. In a further de- 
velopment of this study, Davis 
(1950) found that Ss made slight 
muscular responses to subliminal 
auditory stimuli as well as to supra- 
liminal sounds. In an extension of 
this type of experimental study to 
clinical patients, Malmo and his co- 
workers (Davis, Malmo, & Shagass, 
1954; Malmo, Shagass, & Davis, 
1950), (using 3-second, 1000 cycle 
tones at 80 db, 90 seconds apart) 
found that a group of psychoneurot- 
ics with extreme anxiety showed a 
greater and more long-lasting elec- 
tromyographic response from the 
forearm than a control group of nor- 
mal Ss and that arm muscles showed 
more significant differences than head 
muscles. There were, however, no 
significant differences between schiz- 
ophrenic and control Ss. 

It should be noted that in the 
studies by Davis there was some de- 
crease in the muscle tension responses 
with continued testing, although this 
was not reported in Malmo’s study. 
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In three investigations of the bio- 
logical effects of jet-engine noise, two 
reported positive effects, and one re- 
ported none. Parrack (Parrack, 
Eldridge, & Koster, 1948) exposed 
men to 10-minute periods of jet-en- 
gine noise at 120-150 db and to a 
1300 cps siren; the Ss reported heat- 
ing of the skin, vibration in parts of 
the body, muscular weakness and ex- 
cessive fatigue, all of which disap- 
peared within a week after cessation 
of tests. Allen, Frings, and Rudnick 
(1948) using 10-second exposures of 
a 20,000 cps tone at 160-165 db 
(re: 10~-'* watts per sq. cm.) reported 
that flies, mosquitoes, roaches and 
caterpillars were killed within min- 
utes, while human Ss developed skin 
burns, slight dizziness, and unusual 
fatigue. In contrast to these two re- 
ports, Finkle and Poppen (1948) re- 
ported no measurable effect on a wide 
variety of physiological measures of 
one-hour periods of exposure for 10 
days and two-hour periods for five 
days, to jet-engine noise at about 120 
db. The difference in results between 
this study and the preceding two 
may be due to the lower intensity 
noise and to the difference in the dis- 
tribution of energy in the different 
frequency ranges. 

Several other reports have been 
made of the effect of sound on physi- 
ological processes. Mendelson (1957) 
measured the auditory reflex in man 
as a minute pressure change in one 
ear canal as a function of the inten- 
sity (80-115 db) and frequency (400- 
8000 cps) of tones in the other. Seven 
out of 23 Ss showed a definite effect. 
Coleman and Krauskopf (1956) re- 
viewed the previous reports of an ef- 
fect of loud sounds on vision and indi- 
cated that their experiments, using 
110 and 140 db tones, revealed no 
consistent influences on visual thresh- 
old. They criticized the previous 
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findings as being based mainly on 
subjective impressions. 

In the Benox Report, Ward (1953) 
presented the results of observations 
of the electroencephalograph of one 
S while he was exposed to siren fre- 
quencies of 245-700 cps at 120-137 
db. When the earplugs were removed 
from one ear, there was a “striking 
and prompt abolition or desynchro- 
nization of the parieto-occipital alpha 
rhythm, at the same time good acti- 
vation of the EEG in the fronto- 
temporal region bilaterally was pres- 
ent, consisting of desychronization 
with an increase in the low potential, 
fact activity. When the eyes were 
open, the addition of the siren noise 
to one ear, added nothing to the 
alpha blockade already present. 
These EEG effects appeared to be- 
come less marked with repeated 
opening of the ear during any given 
run, suggesting the possibility that 
adaptation of central neural circuits 
can occur under these conditions.” 
It is unfortunate that no other Ss 
were tested since the two other 
studies which measured EEG’s (Fin- 
kle & Poppen, 1948; Mendelson & 
Conway, 1947) during exposure to 
jet-engine sounds report no, or equiv- 
ocal, changes in EEG recordings. 

There is another important area of 
research to which attention should be 
drawn, although it might seem quite 
peripheral; this is the work dealing 
with sound-precipitated convulsions 
in animals. The most recent review 
of the literature by Bevan (1955) cov- 
ering the period from 1947 to 1954, 
included 145 titles in the bibliog- 
raphy. The major findings of rele- 
vance here seem to be: 

1. Pure-tone experiments would 
indicate that the most effective fre- 
quencies to produce convulsions in 
rats and mice, lie above 8000 cps, 
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and the most effective intensities 
over 100 db. 

2. At least one author has found 
that an intermittent pure tone (9500 
cps) with a three-per-second inter- 
ruption rate (a rate similar to that of 
certain biolectric potentials in epi- 
lepsy) was most effective in produc- 
ing convulsions. 

3. Stimulation of other sense mo- 
dalities without sound does not seem 
to produce seizures. 

4. Many different variables, both 
in the Ss and in the environment, in- 
fluence the incidence and magnitude 
of seizures. 

In the light of some data to be pre- 
sented in succeeding sections, it is 
possible that the phenomenon of 
audiogenic seizure is one which is not 
limited to rats and mice, but that 
humans too may show dispositions 
toward such a response under certain 
conditions. 


PHYSIOLOGICAL EFFECTS OF 
REPETITIVE SOUNDS 


Relatively few studies have dealt 
directly with this problem although 
there are some which have great the- 
oretical value; these, largely by Lov- 
ett Doust and his co-workers will be 


described in some detail. One of 
their first papers appeared in the 
British journal, Nature (Lovett 
Doust, Hoenig, & Schneider, 1952). 
They reported that the use of a flick- 
ering light on 25 normal Ss at fre- 
quencies between 3 and 32 flashes per 
second produced marked changes in 
oxiometrically determined arterial 
blood oxygen-saturation values. 
Flicker rates between 3 and 9 per 
second, and also between 12 and 17 
per second produced a decrease of 
blood oxygen-saturation values, while 
this was normal at 9-11 flashes per 
second, and elevated at frequencies 
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of 18-22 per second. The comment 
was made in this paper that similar 
changes can he produced by replacing 
the flickering light by a relatively 
low-intensity auditory stimulus mod- 
ulated to the same frequency. 

In their next, more extensive paper 
(Lovett Doust & Schneider, 1952) the 
authors pointed out the rationale for 
such studies: 


Biological rhythms exist in a rich variety 
and almost bewildering profusion to attend 
and equilibrate the physiology of man. Such 
dynamic phasic activity appears not only to 
be intimately concerned with the phenomen- 
ology of life and biological processes in general, 
but is also to be found in purely chemical 
systems. Modalities of the periodicities asso- 
ciated with life can be divided into those ex- 
ternal to the organism—including diurnal and 
climatic variation, sun-spot activity, etc., and 
into those inherent within the individual such 
as the respiratory and cardiac rhythms, the 
menstrual cycle, sleep and awakening. Only 
less well marked are certain psychological 
periodicities such as ‘‘cyclothymic” variations 
in mood and personality. In the course of the 
present century much painstaking research 
has attempted to link external with internal 
rhythmic activities, significant correlations 
being adduced between seasonal variation 
and, for example, the incidence of psychiatric 
disorder, immunity from disease, tempera- 
ment and behavior, and an impressive array 
of biochemical and physiological variables 
ranging from blood pH, lactic acid and pro- 
tein to breath-holding time, plethysmography, 
tests of hand strength and fatigability, dark 
adaptation time and various tests of urinary 
function. 

Neurophysiologically, there is evidence 
both for chemical as well as for electrical pat- 
terns of periodic activity in the nervous sys- 
tem. Repetitive discharges with frequencies 
of 5-10 cycles per second from isolated stel- 
late ganglion preparations were found to be 
capable of change by variation in the ionic 
concentrations of the surrounding medium 
and Kaufman and Hoagland have findings 
suggesting a chemical pacemaker closely 
identified with cerebral respiration. The 
function of the diencephalon as a neural pace- 
maker has long. been postulated, and it ap- 
pears certain that the hypothalamic aggrega- 
tion of nuclei with its neural and endocrine 


145 


connections must play an outstanding role in 
emotional and awareness variations (p. 640). 


Using 109 Ss, 58 normal controls 
and 51 hospital patients, they re- 
ported data showing that the arterial 
oxygen saturation levels vary very 
consistently (as described above) for 
normal Ss regardless of the type of 
rhythmic stimulation used. (Photic, 
auditory, and cutaneous stimuli were 
used.) Maximum anoxemia (oxygen 
saturation decrements) occurred at 
5 and 15 pulses per second in the nor- 
mal Ss and they showed in addition a 
summative effect of simultaneous 
sonic and photic stimulation. The 
patient group showed similar but not 
identical stimulation profiles al- 
though some showed unpredictable 
variations in response to the different 
frequencies. 

“By depressing the oxygen levels 
by the choice of optimal stimulation 
frequencies, spontaneous comments 
by the healthy subjects revealed con- 
siderable changes in affect and levels 
of awareness, while, among the pa- 
tients, repressed unconscious ma- 
terial was brought into conscious- 
ness."’ Some of the spontaneous 
comments made during the anoxemia 
periods included: ‘‘concentration 
poor, feel slowed up; tired, drowsy, 
sleepy; irritable, annoyed, fed up; 
desire to stop machine or break it; 
headaches, dizziness, or giddiness.”’ 

The third paper of the series (Lov- 
ett Doust, 1953) discussed the appli- 
cations of these findings to an analy- 
sis of mental illness. Lovett Doust 
presented the hypothesis that an im- 
portant feature of mental disorder, 
at least in the physiological sphere of 
reference is a relative anoxemia, 
either in the resting state or accom- 
panying the dynamic response of 
psychiatric patients to stress situa- 
tions. The fourth paper of the series 
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(Lovett Doust & Schneider, 1954) 
presented some preliminary data ona 
new treatment for patients with psy- 
chiatric disorders based on rhythmic 
sensory stimulation, a method which 
Doust calls ‘“‘Rhythmic Sensory 
Bombardment Therapy” or R.S.B.T. 

Regardless of the actual effective- 
ness of rhythmic sensory stimulation 
for the treatment of the mentally 
ill, this work is very important in 
showing a basic similarity of physio- 
logical effect of repetitive stimuli in 
different sense modalities. From 
this point of view, it makes sense to 
examine the studies of visual flicker 
as a (possibly) reliable guide to the 
effects of auditory flicker. 

A visual phenomenon similar, in 
certain respects, to the one described 
by Lovett Doust, was discovered by 
Bartley in 1938. This is the fact of 
brightness enhancement, which is an 
increase in apparent brightness of a 
flickering light, at certain rates some- 


what below those that bring about 
fusion. This has been studied by us- 
ing two targets side by side—one 
steadily illuminated and the other 


intermittently. The intensity and 
rate of intermittency of the intermit- 
tent target are varied until the 
brightness of the two targets is 
matched. As the rate is reduced from 
the fwsion point less and less intensity 
is required for the intermittent tar- 
get to match the steady one. This 
continues until the pulse rate is in the 
neighborhood of 10 per second, the 
alpha rhythm of the human S. Here 
the intermittent target becomes 
about twice as effective as the steady 
one. With rates lower than this, the 
effectiveness of the intermittent tar- 
get declines again. It has also been 
found that a light-dark ratio of one- 
to-one produces maximum relative 
enhancement and that the pheno- 
menon occurs only with high inten- 
sity stimuli. 
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In addition to this Bartley effect, a 
flickering light produces many other 
results in human Ss. At an ancedotal 
level, many people have unpleasant 
effects of flicker produced by driving 
through a forest or thicket of regu- 
larly planted trees which consistently 
and sequentially interrupt natural 
light sources; similar effects have 
been observed when the S stands still 
and observes a light through a rapidly 
rotating object such as a fan. In re- 
cent years, flickering lights have 
been used as an aid to diagnosis of 
“petit mal’ epilepsy through ob- 
servation of changes in electroen- 
cephalographic pattern. Most recent 
of all is the interest in ‘‘photic driv- 
ing,’’ as a possible clue to the nature 
of (hypothetical) reverberating cir- 
cuits within the brain which are 
thought to be concerned with con- 
sciousness and perception. 

In a fairly extensive and recent 
study of the effects of flickering light 
on human Ss, Bach, Sperry, and Ray 
(1954) used high intensity light 
flashes at varying frequencies to de- 
termine the effects on subjective dis- 
comfort, tapping rates, walking, pur- 
suit rotor performance, and rifle fir- 
ing. Their results may be summa- 
rized as follows: 

1. Unpleasant subjective effects 
are consistently reported when Ss 
are exposed to diffuse flickering 
light. These effects can be grouped 
into four main categories: 


(A) Interference with conscious- 
ness—e.g.: blank mind, 
drowsy, dizzy, paralyzed 

(B) Sensations involving the eyes 

e.g.: fatigue, irritated, wa- 
tering 

(C) Sensations of muscle twitching 
—e.g.: eyes blinking, facial 
twitching, jumping of body 

(D) Sensations relating to other 
parts of body—e.g.: queasy 
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feeling in stomach, headache, 
nausea, chills, tense muscles. 
(Although the subjective re- 
ports in this experiment seem 
to have an unpleasant char- 
acter, there is at least one 
other study in which pleasant 
sensations were also reported, 
for example, sensations of 
warmth which may become 
quite pleasurable, or illusory 
conversations, the content of 
which could not be recalled 
(O’Flanagan, Timothy, & Gib- 
son, 1948). 


2. The most consistently effective 
flicker-frequency for the production 
of subjective effects is 9 flashes per 
second, although any particular fre- 
quency is not extremely critical be- 
tween the limits of 7 and 20 flashes 
per second. The effects are not cumu- 
lative with time of exposure beyond 
about five minutes, while maximum 
effects occur with high brightness of 
the field of view. Monochromatic 
light seems to hold no advantages 
over white light as indicated by pre- 
liminary tests with red, blue, and 
white light. Some degree of drowsi- 
ness was reported in all cases where 
the light was modulated by the 
spontaneous 9 cps EEG activity of 
the S. 

3. Hand-eye coordination was sig- 
nificantly impaired by a flickering 
light for one task (tapping) but not 
affected on another supposedly more 
complex tapping task. Rifle firing ac- 
curacy was significantly depressed 
(by approximately 50%) when a 6 
cps flickering light was placed behind 
the target at brightness levels of the 
order of magnitude of the scattered 
light of searchlight beams, but when 
the direction of the diffuse flickering 
light was toward the target, rifle fir- 
ing accuracy increased. Rate of walk- 
ing under conditions of diffuse flick- 
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ering light did not seem to be signifi- 
cantly affected, even in the presence 
of obstacles and with a continuously 
moving light source. 

This report therefore, as well as a 
host of previous studies (Ellingson, 
1956) indicates clearly the marked 
effects which high intensity intermit- 
tent stimuli may have on behavior, 
feelings, and on certain physiological 
measures. In fact it has been re- 
cently possible to develop an index 
of anxiety proneness by examination 
of the EEG reactions to photic stimu- 
lation. Ulett et al. (1953) tested 191 
Ss, both patients and controls, on 
various psychological tests and by 
interviews, and had them rated for 
anxiety proneness under stress. EEG 
records were then taken during con- 
ditions of resting and intermittent 
photic stimulation. The results indi- 
cated that there was a significant cor- 
relation between the psychological 
criteria of anxiety proneness and: the 
percentage of fast, slow and low am- 
plitude alpha in electronically an- 
alyzed resting EEG responses; the 
amount of harmonic responses in the 
20-30 cps range when flicker fre- 
quencies of one half or one fourth of 
this were used; and the amount of 
subjective dysphoria produced by 
photic stimuli. A check list of EEG 
anxiety indicators derived from this 
correlated +0.51 with the validating 
criteria of anxiety proneness. 

As far as could be determined 
there is only one study which has re- 
ported an attempt to “drive’’ brain 
rhythms by means of intermittent 
auditory stimulation. This study 
was described very briefly in an ab- 
stract of a talk to an EEG Congress 
by Goldman (1952): He reported 
that pure tones, which were inter- 
rupted in rhythmic fashion at rates 
of 1.5 to 25 per second, were used and 
that EEG changes appeared showing 
acoustic driving in temporal areas in 
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two out of eight cases. No other data 
were given and it is evident that repe- 
tition of this study would be interest- 
ing and fruitful. 


SUMMARY AND CONCLUSIONS 


This review has attempted to sum- 
marize and integrate a number of 
articles dealing with the effects of 
loud and intermittent sounds on 
human behavior, feeling, and physi- 
ology. Most of these studies have been 
published since 1950 when the last 
comprehensive review was written. 

Some of the more recent experi- 
ments demonstrate effects of very 
loud sounds on certain kinds of com- 
plex behavior particularly ‘‘clock- 
watching”’ and time estimation with 
the possibility implied that the dec- 
rement in performance may depend 
on the level of the sound as well as 
on its intermittency. 

With regard to the effects of high 
intensity or intermittent sound on 
feeling, the literature is almost unan- 
imous: high intensity noise, even 
when it may have no effect on per- 
formance will generally produce 
symptoms of discomfort, irritability, 
and distraction, although there is 
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little known about the relative an- 
noyance value of different kinds and 
levels of intermittency. 

Certain unique subjective charac- 
teristics of repetitive sounds are de- 
scribed relating to the effects of vari- 
ous rates of repetition on fusion, es- 
timation of pulse frequency, tonal 
character, threshold, and loudness. 
The greatest effects are usually 
obtained at repetition rates between 
5 and 10 pulses per second, a fre- 
quency range which coincides more 
or less closely with the alpha rhythm 
of the brain. 

With regard to the effects of loud 
or intermittent sounds on physiology, 
changes in blood pressure, gastric 
secretion, pulse rate, palmar sweat- 
ing, respiration, muscle tension, the 
electro-encephalogram, and_blood- 
oxygen saturation have been re- 
ported in various studies. 

Some theoretical concepts are pre- 
sented which postulate effects of 
auditory intermittency parallel to 
those of visual flicker. In general, the 
need for more research on the effects 
of intermittent sounds of various fre- 
quencies, repetition rates, and in- 
tensities is evident from this review. 
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Studies concerned with the dy- 
namogenic effects of anxiety, as 
measured by the Taylor Manifest 
Anxiety Scale (MAS), on learning 
and performance (e.g., Spence & 
Farber, 1953; Spence, Farber, & Mc- 
Fann, 1956; Taylor & Chapman, 
1955; Taylor & Spence, 1952) have 
typically regarded the MAS as a dis- 
criminating measure of generalized 
drive (D). Evidence for the validity of 
the test is considered present when- 
ever extreme groups, assigned on the 
basis of test score, perform in the 
direction hypothesized by related D 
theory. The particular theory in- 
volved has been treated in detail by 
Farber (1955), Taylor (1956), and 
Spence (1958). 


CONSTRUCT VALIDITY 
AND THE MAS 


Experimental validation of a test 
within a theoretical framework rep- 
resents an attempt at construct vali- 
dation. Basic to construct validation, 
as conceived by Cronbach and Meehl 
(1955), is the existence of a nomo- 
logical network that relates the con- 
struct to observables and to other 
constructs. Extending the validity of 
a construct would involve, according 
to Cronbach and Meehl (1955), 
i elaborating the nomological 
network in which it occurs, or of in- 
creasing the definiteness of the com- 
ponents” (p. 290). They further 
stated that the validation of a test 
claiming to measure a _ construct 
would require the existence of a 
nomological net surrounding that 
construct. The recent article by 
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Spence (1958) attempted, in fact, to 
describe the nomological net sur- 
rounding the construct of emotion- 
ally based D. 

In their criticism of the construct 
validity of the MAS, Jessor and 
Hammond (1957) questioned the ex- 
tensiveness to which various aspects 
of the nomological net surrounding D 
have been investigated experimen- 
tally and alternative inferences dis- 
confirmed. They accurately noted 
that MAS studies have concentrated 
specifically on the energizing com- 
ponent of D and have generally ig- 
nored other, perhaps equally impor- 
tant, aspects of the net. The Jessor 
and Hammond criticism is certainly 
germane not only to the MAS but to 
other scales purporting to measure D. 

The present writers, moreover, are 
inclined to the critical view that even 
those studies dealing with the dy- 
namogenic aspects of the net en- 
compassing D have, in general, con- 
tributed little to a systematic con- 
struct validation of the MAS and 
other D-oriented scales. The litera- 
ture in the area fairly abounds with 
studies yielding conflicting or equiv- 
ocal results. Experiments on serial 
and paired-associates learning, in 
particular, have been conspicuously 
contradictory in their findings. On 
one hand, Spence, Farber, and Mc- 
Fann (1956), Spence, Taylor and 
Ketchel (1956), Taylor and Chap- 
man (1955), Taylor and Spence 
(1955), and others (Montague, 1953; 
Ramond, 1953) have reported evi- 
dence consonant with the nomolo- 
gies present in the D net; on the 
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other hand, Saltz and Hoehn (1957), 
Katchmar, Ross, and Andrews 
(1958), Sarason (1957a, and 1957b). 
Hughes, Sprague, and Bendig (1954), 
and others have provided evidence 
against the expectations of the theory. 
The present writers contend that a 
major difficulty hampering and con- 
fusing research in this area has been 
the tendency to confound interac- 
tional effects within the nomological 
net with certain methodological prob- 
lems inherent in the research design. 
The express purpose of this paper is 
to point out some of these pertinent 
methodological problems and to sug- 
gest some possible ways of resolving 
them. 


METHODOLOGICAL PROBLEMS IN 
THE NATURE OF THE TASK 


According to the proponents of D 
theory, both D level and the relative 
strengths of competing responses 
must be accounted for in predicting 
the effect of D upon performance. 


Taylor (1956), for example, explicit- 
ly stated the position as follows: ‘In 


situations in which a number of 
competing response tendencies are 
evoked, only one of which is correct, 
the relative performance of high 
and low drive groups will depend 
upon the number and comparative 
strengths of the various response tend- 
encies” (p. 304). In simple learning 
situations such as conditioning where 
a single response tendency is to be 
acquired, the prediction from D 
theory is straightforward. High D 
groups are predicted to condition at 
a faster rate than low D groups. 
Considerable experimental evidence 
would seem to support this predic- 
tion based upon the MAS. In verbal 
learning, and especially paired-asso- 
ciates learning, the prediction would 
be based upon the amount of inter- 
ference or competition within the list. 
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D theory would predict that with lit- 
tle intratask interference a high D 
group should be superior to a low D 
group; however, on tasks involving 
considerable intratask competition, 
the low D group would be predicted 
to be superior. The validity of these 
nomologies within the D net, inci- 
dentally, has been questioned by 
Hill (1957) on purely theoretical 
grounds. 

Because of the D level-response 
competition interaction that has been 
postulated to operate within the 
nomological net, research relating D 
level as measured by D-oriented 
scales (chiefly the MAS) to per- 
formance on paired-associates tasks 
requires a “‘control’’ over the degree 
of competition within the task. The 
empirical results of such studies 
would then represent evidence for a 
construct validation of the scale. A 
crucial condition, therefore, in these 
validity studies is the degree to which 
E has controlled intratask interfer- 
ence. The typical procedure is to se- 
lect nonsense syllables or adjectives 
in such a way that the similarity of 
the material and the association val- 
ue of the material within the list can 
be conveniently manipulated. Simi- 
larity is usually manipulated by 
varying the letter content of items 
within the list, and association value 
is regulated by selecting syllables 
from Glaze’s (1928) or Hull’s (1933) 
tables with previously calibrated as- 
sociation values. By such manipu- 
lations, E derives a list which he con- 
siders representative of either a com- 
petitional or noncompetitional task. 
As Saltz and Hoehn (1957) point out, 
this procedure confounds amount of 
intratask competition with the diff- 
culty of the task. They contend that 
an increase in response competition is 
accompanied by an increase in diffi- 
culty of the list. They conducted sev- 











154 


eral studies in which they attempted 
to partial out difficulty from compe- 
tition. As a result, their findings 
failed to support D level-response 
competition theory. 

In manipulating lists of nonsense 
syllables or adjectives in terms of 
similarity and association value for 
purposes of varying intratask inter- 
ference and/or controlling for task 
difficulty, E seems to be operating 
under the basic premise that compe- 
tition within a list is independent of 
D as measured by his D-oriented 
scale. We seriously question this 
premise particularly as it applies to 
the association value of nonsense 
syllables and adjectives. The cali- 
brations of Glaze and Hull, it may 
be recalled, were based upon groups 
undifferentiated on any D dimen- 
sion. It does not seem unreasonable 
to suspect that performance on D 
scales and number of associations to 
nonsense syllables may have some 
covariance. If true, lists constructed 
from such calibrations would not be 
comparable lists in terms of compe- 
tition and difficulty for high and low 
D groups selected on the basis of 
scale score. Some adequate empirical 
demonstrations of independence be- 
tween these variables are clearly 
needed. 


METHODOLOGICAL PROBLEMS IN 
DRIVE MEASUREMENT 


Another important methodological 
problem encountered in construct 
validation studies on D-oriented 
scales pertains to the conditional defi- 
nition of D which the use of such 
scales involves. These difficulties 
have been discussed by Jessor and 
Hammond (1957). They concluded 
with the statement: ‘‘When a con- 
struct implies a relationship between 
variables, these variables must be 
designated independently of any test 
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of that relationship” (p. 169). The 
methodology commonly employed in 
studies involving D scales and com- 
plex learning tasks has departed 
grossly from this important qualifi- 
cation. The scale has been employed 
both to establish the validity of the 
construct (D) and simultaneously to 
establish the construct validity of 
the scale. Under such confounding 
dual purposes, failure of the data to 
fulfill the predicted outcome cannot 
be taken as substantive evidence for 
either an absence of construct valid- 
ity in the scale or for an incorrect 
nomological net. The results of such 
studies serve mainly to confuse and 
cloud the issue. 

As a way out of this protracted 
dilemma, the writers suggest that re- 
search in this area begin to utilize ex- 
perimentally induced D states as 
controls for evaluating the effects of 
response-inferred D states. We fur- 


ther recommend positive, nonemo- 
tional, approach drives in preference 


to negative, emotional, avoidance 
drives, such as shock or threat of 
shock, which are fraught with so 
many unsettled theoretical problems 
in their own right. The nomological 
net surrounding the positive drives is 
more clearly defined and has received 
wider empirical support than is gen- 
erally the case for the negatively 
based drives. 

Since experimentally induced D 
would be considered as contributing 
to generalized D, predictions based 
on this form of D should be compar- 
able to those based on D inferred 
from performance on a scale. For ex- 
ample, groups performing under high 
and low incentive conditions should 
be at least partially equated with 
high and low D groups selected on 
the basis of MAS scores. Predictions 
of this nature would extend beyond 
mere rate of acquisition in complex 
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learning and would include such di- 
verse phenomena as intentional ver- 
sus incidental learning, positive and 
negative transfer, extinction, etc. 
The kind of controls advocated here 
would not, of course, eliminate from 
consideration other interpretations 
pertaining to the construct validity of 
D scales. Predictions concerning 
emotionally based D upon an inter- 
fering response theory, as exempli- 
fied by Sarason and Mandler (Man- 
dler & Sarason, 1952; Sarason, Man- 
dler, & Craighill, 1952), or upon a 
habit theory, as exemplified by Hil- 
gard (1953) and Child (1954), would 
be difficult to test discriminately by 
any method. The procedure pro- 
pounded here has as its basic merit 
the elements for establishing a more 
well-defined baseline in an area be- 
sieged with many intricacies. 

Two alternatives are suggested 
here for providing the type of control 
discussed above. The first consists of 


a replication of a carefully designed, 


well-conceived experiment in which 
an experimentally induced positive 
D had been manipulated and clear- 
cut results obtained. The replication 
would consist of repeating the task 
and procedure as closely as possible 
with groups selected as high and low 
scorers, respectively, on the D scale 
to be validated. Such replication 
would require a preliminary demon- 
stration that competition within the 
material to be learned showed no co- 
variance with score on the scale. A 
study in preparation by the writers 
exemplifies this approach. It con- 
sisted of a replication of a study by 
Bahrick (1954) in terms of material 
learned and procedure followed, but 
was extended to a different D dimen- 
sion. Bahrick had demonstrated that 
a high incentive (financial reward) 
group displayed significantly greater 
intentional learning but significantly 
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less incidental learning than a low in- 
centive (no financial reward) group. 
The learning task consisted of the 
serial learning of geometric forms. 
Since the nature of the task is one 
that probably involves little intra- 
task competition, it should differ- 
entiate high and low D groups se- 
lected by MAS score in a similar 
manner. That is, D theory would 
predict that high scorers on the 
MAS should show greater intentional 
learning but less incidental learning 
than low scorers. 

We found, as in the Bahrick study, 
that the high D group (now identi- 
fied by MAS score), comparable in in- 
telligence and sex distribution to the 
low D group, displayed a significantly 
higher rate of intentional learning. 
Unlike the Bahrick study, however, 
the low D group did not display su- 
perior incidental learning. The re- 
sults would therefore seem to conflict 
with the finding by Silverman and 
Blitz (1956) that high scorers on the 
MAS showed significantly less inci- 
dental learning on a serial list of non- 
sense syllables with low association 
values than low scorers. Silverman 
and Blitz interpreted their findings 
in terms of interfering responses cor- 
relating with anxiety. Unfortunate- 
ly, the lack of information concerning 
the comparability of the list of syl- 
lables for the two groups and the lack 
of control groups performing on the 
same task under more operationally 
defined D conditions make it difficult 
to evaluate such results as either 
supporting or rejecting the construct 
validity of the MAS. 

The second suggested alternative 
follows in a natural vein from the 
first. Studies that are directed at 
the construct validity of D scales 
should include within their research 
design, whenever possible, compa- 
rable groups performing the same task 
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as the experimental groups but under 
varying levels of an experimentally 
induced approach D. The perform- 
ance of such control groups would 
then provide a more adequate basis 
for evaluating the effects of D in- 
ferred from scores on a D scale. A 
recent study by Katchmar, Ross, and 
Andrews (1958) illustrates this point 
nicely. They compared rate of learn- 
ing for a coding task between groups 
differentiated in D level in terms of 
MAS anxiety, ego involvement, and 
failure-induced stress. Their design 
is certainly excellent as far as making 
relative comparisons between these 
three variations of emotionally based 
D. In their discussion, however, they 
state the results for high and low 
MAS scorers do not support the theo- 
retical formulation of Taylor and 
Spence. Thus they are inferring that 
the study does not support the con- 
struct validity of the MAS. In our 
opinion, a more adequate test of the 
construct validity of the MAS in 


their study would require two addi- 
tional control groups, consisting of 
more clearly defined high and low D 
groups, where D is manipulated ex- 
perimentally in a nonemotional, posi- 
tive, approach direction. 
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SUMMARY 


Experimental studies directed at 
establishing the construct validity of 
D-oriented scales, such as the Taylor 
MAS, are beset with theoretical and 
methodological problems that make 
it difficult to interpret their results. 
This is particularly true in studies 
that relate response-inferred, emo- 
tionally based D to verbal learning. 
As a partial answer to these prob- 
lems, the writers contend that studies 
employing verbal tasks should re- 
quire a prior demonstration of com- 
parability of the task for extreme 
groups identified by D scale perform- 
ance. This need is dictated by the 
emphasis placed on the interaction 
between D level and intratask com- 
petition in contemporary D theory. 
The writers further contend that re- 
search in this area requires informa- 
tion collected from control groups 
performing on the same task as the 
D scale groups under high and low D 
conditions that represent clearly de- 
fined, experimentally induced moti- 
vational states. The information 
thus provided would serve as a base- 
line for evaluating the evidence for or 
against the construct validity of the 
D-oriented scale. 
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ON THE CLASSIFICATION OF PROJECTIVE TECHNIQUES! 
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Classification is a menial task! It is 
generally considered to be tedious, 
unexciting, and an activity, at least 
for American psychologists, to be 
delegated to someone else. In spite 
of this deficiency in allure, and the 
general tendency of most psycholo- 
gists to prefer the immediately more 
rewarding activities of experimental 
analysis or broad theoretical formula- 
tion, there seems little doubt that 
some emphasis upon classification is 
important in every branch of psy- 
chology. The concern of the present 
paper is with a relatively humble 
problem of taxonomy—the classifica- 
tion of projective techniques. 

I shall summarize briefly a num- 
ber of approaches to this problem 
and suggest a basis for classification 
that seems to me superior to the vari- 
ous alternatives. Such an enterprise 
is obviously of, interest to those who 
teach or theorize about projective 
techniques, for some kind of order 
must be imposed upon this diverse 
array of instruments if they are to be 
discussed efficiently and intelligently. 
Moreover, a classification that can 
be agreed upon, and that seems to 
make psychological sense, should 
serve some function in research and 


1 This paper is an outgrowth of a mono- 
graph on the use of projective techniques in 
social research initiated by the former Com- 
mittee on Social Behavior of the Social Science 
Research Council. Preparation of the article 
was supported by Research Grant M-1949 
from the National Institute of Mental Health 
of the National Institutes of Health, Public 
Health Service. I am grateful to my colleague 
Ephraim Rosen for his suggestions and to 
Arthur Hill for assistance in preparing the 
tabular material. 
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applied settings, where the interrela- 
tion between various projective tech- 
niques frequently appears to be a 
matter of vast importance. 

Before turning to a definite pro- 
posal in regard to classification, let 
us consider some of the suggestions 
that have been made in the past. A 
number of prior observers have con- 
cerned themselves with the ways in 
which projective techniques can be 
clustered or grouped, but probably 
the first and best known of such at- 
tempts is contained in the pioneer 
article by Frank (1939), recommend- 
ing the use of projective techniques. 
He suggested that these instruments 
could be distinguished in terms of 
whether the responses they elicited 
were constitutive, interpretative, ca- 
thartic, or constructive. The test 
may be considered constitutive if the 
S is required to provide a structure or 
form for relatively unstructured or 
ambiguous stimuli, such as finger 
paints or Rorschach cards. When 
the S is asked to indicate what the 
meaning of the stimulus is to him, for 
example, if he is asked to assign 
meaning to a picture, the test is con- 
sidered to be interpretative. The 
cathartic test involves some delib- 
erate attempt to induce the S to ex- 
press or release emotion in the process 
of reacting to the stimuli, as in the 
case of doll play or psychodrama. If 
the S is required to build or organize 
stimulus materials, such as blocks or 
toys, in such a manner as to reveal 
“some of the organizing conceptions 
of his life” (p. 403) the test is labelled 
constructive. In a subsequent formu- 
lation Frank (1948) has added the 
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category of refractive techniques. 
These are devices that depend upon 
error or distortion in the S's judg- 
ment of some set of stimuli, and are 
typified by instruments involving 
tachistoscopic presentation of stimuli 
related to a given motive or conflict. 

Helen Sargent (1945) has consid- 
ered the problem of classifying pro- 
jective techniques somewhat more 
broadly, suggesting that these de- 
vices may be grouped in terms of: 
(a) the nature of the materials pre- 
sented to the Ss; (6) the functional 
use the S makes of the materials; (c) 
the method of presentation used by 
the examiner; and (d) the purpose 
for which the test is employed. She 
suggests that under the heading of 
materials, the tests may be distin- 
guished in terms of whether they in- 
volve presentation of ink blots, pic- 
tures, stories, art media or sounds. 
The categories proposed under the 
heading of functional uses employed 
by the S are the same as those ini- 
tially recommended by Frank 
stitutive, interpretative, cathartic, 
and constructive. In distinguishing 
between tests on the basis of method 
or technique of presentation, Sargent 
considers variations in both presenta- 
tion and interpretation. The main 
distinctions in presentation she dis- 
cusses have to do with the degree of 
standardization, or “experimental 
control,” that is imposed upon the 
examiner and S. Differences in in- 
terpretive approach that she identi- 
fies have to do with an emphasis upon 
empirical origin, as opposed to the- 
oretical derivation, of the interpreta- 
tive system. She suggests distin- 
guishing the tests in purpose accord- 
ing to whether they are used princi- 
pally for diagnosis, therapy, or ex- 
periment. 

In a discussion of the indirect 
measurement of attitudes, Campbell 
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(1950) suggests three classificatory 
principles that have implications for 
projective techniques. First of all, 
there is the question of whether the 
device is disguised or not, that is, 
whether the S can estimate accu- 
rately the intent of the examiner. This 
dimension might appear to have no 
utility in the present context, as vir- 
tually all projective techniques are 
assumed to be disguised, but there is 
actually a moderate degree of varia- 
tion among projective techniques 
along this dimension. Second is the 
question of whether the instrument 
is structured or not. Campbell ap- 
pears to use this term to refer to both 
the ambiguity of the stimulus and 
the amount of freedom permitted the 
S in determining how he will respond. 
Again these are qualities that are in- 
volved in the differentiation of pro- 
jective techniques from other per- 
sonality devices. Nevertheless, there 
is considerable variation between 
projective techniques along these 
dimensions and they might well pro- 
vide the basis for important classifi- 
catory distinctions. Third is the dis- 
tinction between “voluntary self-de- 
scription as opposed to diagnosis based 
upon differential performance in an 
objective task’ (p. 15). Virtually no 
projective technique can be consid- 
ered to depend upon “voluntary self 
description” so that this dimension is 
of only passing interest in its present 
form. 

Recently, Campbell (1957) has 
presented a revision of this analysis 
which is intended tu refer more spe- 
cifically to projective techniques. 
The new formulation includes three 
polar dimensions: voluntary versus ob- 
jective (Is the S to report something 
accurately or is he to provide his 
“own” or “‘first’’ response, without 
regard for correctness?) ; indirect ver- 
sus direct (Does the S know the pur- 








160 


pose of the test?); and free-response 
versus structured (Can the S respond 
much as he chooses or must he select 
from a limited array of alternatives?). 
These dimensions are then combined 
to describe various kinds of psycho- 
logical tests and Campbell considers 
examples of the resultant types. He 
concludes that most projective tech- 
niques are voluntary, indirect, and 
free-response but some projective 
techniques can be described as volun- 
tary, indirect, and structured; volun- 
tary, direct, and free-response; objec- 
tive, indirect, and free-response; and 
objective, indirect, and structured. 

Cattell (1951) has suggested that 
the fundamental process involved in 
projective tests is not projection but 
misperception and that these devices 
should, consequently, be called ‘‘mis- 
perception techniques.’ —Further- 
more, he has indicated that such de- 
vices may be divided into four differ- 
ent classes depending upon the form 
of misperception that operates. First, 
there is the instrument that depends 
on naive misperception, where the S is 
unable to recognize the fact that 
others feel and think differently than 
he does and, as a result, generalizes 
his own perceptions to everyone else. 
Second, there is the test that utilizes 
the process of autism, where the S 
modifies or distorts his perception in 
such a manner as to satisfy or reduce 
his needs and desires. Third are the 
instruments that involve press com- 
patibility misperception, where the S 
views the environment as existing in 
such a form as to fit, or make reason- 
able, his motives and affective states. 
Fourth are the devices that depend 
upon ego defense misperception where 
the distortion in perception takes 
place at the service of unconscious 
and repressed motives, in a form de- 
termined by the various mechanisms 
of defense. 
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There are many additional distinc- 
tions between projective techniques 
that can be proposed. For example, 
we may point to the differences be- 
tween structural or formal techniques, 
as opposed to “‘meaning’’ or content 
techniques. Here the distinction has 
to do with whether, in interpreting 
the test, the focus of the examiner is 
upon the way in which the task is 
performed—the speed and quantity 
of response, the relative frequency of 
certain types of words, the tendency 
to respond to all or to part of the 
stimulus, etc.—or upon the meaning- 
ful outcome of the performance. The 
formal device is concerned with cer- 
tain quantifiable aspects of the re- 
spondent’s general pattern of re- 
sponse, and there is little or no in- 
terest in the content or meaning of 
what the respondent is saying or do- 
ing. The Rorschach technique is 
usually considered to be primarily a 
formal test, although there is consid- 
erable evidence for a shift in recent 
years toward more extensive use of 
content in interpretation. If the in- 
terpretation is focussed upon what the 
individual says or does and its mean- 
ing, or the thematic connection be- 
tween various response elements, the 
instrument would be classified as a 
content technique. Illustrative of 
this type of instrument is the cus- 
tomary use of the Thematic Apper- 
ception Test. 

Further, we might distinguish be- 
tween those tests that are admin- 
istered individually, as opposed to 
those that are capable of group ad- 
ministration. Actually, this is a difh- 
cult distinction to maintain, for as 
soon as someone develops an indi- 
vidual technique that seems to pos- 
sess utility, there are certain to be a 
number of investigators eagerly seek- 
ing to adapt the technique for group 
administration. Nevertheless, at any 
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given point in time it is possible to 
distinguish between tests in terms of 
how readily they can be adapted to 
meet the demands of group admin- 
istration. For example, the sentence 
completion test’ can be given in 
group settings very readily, while 
doll play or word association tech- 
iiques are considerably more difficult 
to administer outside of the indi- 
vidual session. 

One might classify projective de- 
vices in terms of the sense modality 
involved. For example, there are 
visual, auditory, and even tactual 
stimuli employed in tests now in use. 
An additional distinction between 
these devices can be made in terms 
of the degree of response multiplicity 
permitted by the technique. There 


are a few techniques that require the 
S to choose between a small num- 
ber of specified alternatives, for ex- 
ample, the Szondi Test; while others 
permit a theoretically (and almost 


practically) limitless number of re- 
sponses, as in the TAT. 

Similar to one of the distinctions 
proposed by Sargent, is the difference 
between rational and empirical tests. 
On the one hand, we have techniques 
where there is no rationale provided 
for the fact that a given type of re- 
sponse seems to be associated with a 
given personality characteristic. Nor 
is the individual who develops such 
an instrument concerned with this 
state of affairs. As long as there is a 
firm association between a particular 
type of test response and a given per- 
sonality variable, he believes that the 
test may be used in a dependable and 
useful fashion. The extreme of this 
approach implies only an interest in 
the empirical regularity, with no con- 
cern for underlying processes or in- 
termediary factors. On the other 
hand, we have techniques where 
there is a reasonably careful attempt 
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to provide a prior rational or theoret- 
ical basis to justify the use of a par- 
ticular response element as diag- 
nostic of a particular personality at- 
tribute. In practice, it is clear that all 
devices represent mixtures of these 
two extremes. The individual who 
professes to be disinterested in theory 
and concerned solely with empirical 
association must make some decision 
about where to look for his empirical 
regularity, and here he obviously 
must drag in some “theory” or prior 
assumptions. On the other hand, the 
individual who is interested in a prior 
rationale, if he is sophisticated, must 
show considerable curiosity about 
whether his theoretically predicted 
relationship is, in fact, sustained in 
the world of reality and, thus, he in- 
troduces the crass empirical criterion. 
In spite of this overlap, many instru- 
ments appear to be more heavily in- 
fluenced by prior theorizing than 
others. In general, the Blacky Pic- 
tures have developed with a close 
relationship to explicit theory, while 
the Rorschach, during at least much 
of its development, seems to have 
been treated as an empirical device. 

The potential systems of classifica- 
tion we have considered are by no 
means exhaustive but they serve to 
illustrate the rich variety that offers 
itself to the person who surveys this 
area. How to choose between all of 
these alternatives? Perhaps an 
answer to this question can be pro- 
vided by attempting to classify the 
classifications. That is, if all the prin- 
ciples of classification can be grouped 
together, it may be possible to select 
from among them the avenue that 
seems most fruitful. 

From what has already been said, 
it is clearly possible to distinguish be- 
tween six different approaches to the 
classification of projective tech- 
niques. First, there is the distinction 








162 


based upon attributes that inhere in 
the test material itself. Here we are 
concerned with variation in the 
stimulus material, for example, struc- 
tured versus unstructured or audi- 
tory versus visual. Second, we may 
classify the tests in terms of the 
method by which the technique was de- 
vised or constructed, for example, the 
distinction between rational and em- 
pirical techniques. Third, we may 
distinguish between these devices on 
the basis of the manner in which the 
test is interpreted, for example, formal 
analysis versus content analysis, or 
“sign’’ interpretation versus holistic 
interpretation. Fourth, we might 
propose a classification that is based 
upon the purpose of the test, for ex- 
ample, the assessment of conflict as 
opposed to the measurement of mo- 
tives, or the general description of 
personality as opposed to the estima- 
tion of specified dimensions of per- 
sonality. Fifth, we might propose a 
set of categories that are concerned 
with differences in the administration 
of the test, for example, group tech- 
nique as opposed to individual tech- 
nique, or self-administered versus ex- 
aminer-administered. Sixth, we can 
distinguish between the instruments 
on the basis of the type of response 
they elicit from the S, for example, 
story construction as opposed to as- 
sociation. 

All of these distinctions have some 
usefulness and something can be said 
in favor of each of them as providing 
the best means for classifying projec- 
tive techniques. In spite of this, | 
would argue that the final type of 
classification, the one based upon 
differences in type of response, is 
easily the most important and the 
one that merits emphasis. The essen- 
tial consideration here is that this 
classification seems most likely to be 
closely related to the underlying psy- 
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chological processes involved in the 
various tests, for it is this classifica- 
tion that points to what the S is ac- 
tually doing. In so far as these tests 
are distinctive, and to be treated as 
significantly different, it seems likely 
that the major determinant of this 
distinctiveness will be the differences 
in what the S is actually engaged in 
as he responds. It is also worth note 
that a number of the other types of 
classification are more or less directly 
specified by distinctions based upon 
mode of response, for example, if the 
technique elicits choice responses, we 
know a good deal about whether it 
will emphasize formal or content 
analysis, whether it is likely to be 
capable of group administration, and 
whether it will be structured or not. 

Even if we agree that distinctions 
between projective techniques based 
upon variation in the type of response 
elicited from the S are most impor- 
tant, there is still the task of arriving 
at just the proper array of such dis- 
tinctions. For most purposes it 
seems sufficient to think in terms of 
five general types of response. These 
are: (a) association, (b) construction, 
(c) completion, (d) choice or order- 
ing, and (e) expression. Obviously, 
not every test can be fitted neatly 
into only one of these categories. 
There is the usual overlap and am- 
biguity in the world of reality. How- 
ever, with very little effort it is possi- 
ble to classify virtually every projec- 
tive technique as involving predom- 
inantly one of these types of re- 
sponses. More significant is the fact 
that when projective techniques are 
classified on this basis, we find that 
the instruments brought into the 
same category have a general congru- 
ence and psychological consistency 
that makes it easily possible to con- 
ceive of similar underlying psycho- 
logical processes. 
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The reader will note certain sim- 
ilarities between the present cate- 
gories and those proposed by Frank 
(1948). However, Frank’s categories 
do not consistently refer to the na- 
ture of the S’s response, for example, 
the refractive distinction points to 
the interpretive process; and, in some 
cases, the distinction implied by his 
labels does not seem empirically 
clear, for example, the distinction be- 
tween interpretive, constitutive, and 
constructive is by no means evident. 
Most important is the fact that his 
categories do not produce the same 
clusters of instruments that the pres- 
ent classification generates. 

Let us characterize very briefly 
each of these types of projective 
techniques and indicate, in an illus- 
trative manner, the individual tests 
that would be included under each 
heading. First of all, are the associa- 
tive techniques. Here the S is set to 
respond to some stimulus presented 
by the examiner with the first word, 
image or percept that occurs to him. 
Such devices minimize ideation and 
emphasize immediacy. The S is not 
to reflect or reason but, rather, to 
respond with whatever concept or 
word, however unreasonable, first 
rises to consciousness, or occurs to 
him. 

These techniques, in certain re- 
spects, represent a bridge between ex- 
periment and the clinical setting, for 
in both areas there have been exten- 
sive studies of what happens when an 
individual is asked to respond to some 
stimulus with the first association 
that comes to his mind. It was nat- 
ural that students of the normal, 
conscious, human mind should use 
this device as a means of mapping, 
or laying bare, the structure of 
mental events. Further, once Freud 
had devised the method of free asso- 
ciation this approach was accepted 
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as an important means of gaining 
insight into the hidden reaches of the 
mind. It is not surprising, therefore, 
that a number of important tech- 
niques embodying this mode of re- 
sponse have been developed, the most 
popular of which are the Word Asso- 
ciation Test and the Rorschach test. 
Also typical of this type of instru- 
ment are Stern’s Cloud Test and cer- 
tain auditory projective tests. 

Second are the construction tech- 
niques. Here we find a group of in- 
struments that require the S to create 
or construct a product which is typi- 
cally an art form such as a story or 
picture. There is a minimum of re- 
striction placed upon the S’s re- 
sponses and in some cases, such as the 
blank card of the Thematic Apper- 
ception Test, even the original stim- 
ulus is under little control by the ex- 
aminer. 

The focus of this type of instru- 
ment is upon the outcome, or prod- 
uct, constructed by the S and not 
upon his behavior or style in the 
process of creating or responding. 
The S is set to provide a product that 
is meaningful and personally relevant 
to the eliciting stimuli. The response 
process may begin with simple asso- 
ciation, but the requirements of these 
tests force the S to modify and elab- 
orate the original association, so as to 
satisfy normative requirements for 
what constitutes a story or other art 
form. Unlike the associative tech- 
niques, these instruments require the 
S to engage in complex, cognitive ac- 
tivities beyond mere association. II- 
lustrative of these devices are the 
Thematic Apperception Test, the 
Blacky Pictures, and the Make-A- 
Picture-Story Test. 

Third, we find the completion tech- 
niques. These measures provide the 
S with some type of incomplete prod- 
uct and the requirement that he com- 

















164 


plete it in any manner he wishes. 
They differ from the associative tech- 
niques in that both the stimulus and 
the response are typically much more 
complex and thus the response is less 
immediate. Furthermore, the com- 
pleted product is usually expected to 
meet certain external standards of 
good form or rationality, e.g., there 
are rules about what constitutes a 
sentence or a story and they presum- 
ably operate to determine the S’s 
completions. When compared to the 
construction techniques, the re- 
sponses elicited by these instruments 
are generally simpler and more re- 
stricted. The best known example 
of this type of instrument is the 
Sentence Completion Test, but 
equally typical are the Picture Frus- 
tration Study and argument comple- 
tion and story completion tech- 
niques. 

Fourth are choice or ordering tech- 
niques. These instruments resemble 
the associative measures in the sim- 
plicity of the response set provided 
for the Ss. Here the respondent 
merely chooses from a number of 
alternatives the item or arrangement 
that fits some specified criterion such 
as correctness, relevance, attractive- 
ness, or repugnance. In some cases, 
such as the multiple choice Rorschach 
and TAT, these devices mirror other 
techniques except that the S is asked 
not to produce an association or a 
construction but rather to select from 
a number of hypothetical responses 
the one that seems most appropriate 
to him. The two tests that provide 
the most effective illustration of this 
category are the Szondi Test and the 
Picture Arrangement Test. 

Fifth are the expressive techniques. 
As a class, these methods represent a 
bridge between the diagnostic and 
therapeutic, for all of them play an 
active role in current therapeutic 
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practice. It is presumed for these 
measures that the S not only reveals 
himself, but also that he expresses 
himself in such a manner as to influ- 
ence his personal economy or adjust- 
ment. Typically these instruments, 
as in the case of the constructive 
techniques, require the S to combine 
or incorporate stimuli into some kind 
of a novel production. Unlike the 
constructive techniques, however, 
there is as much emphasis upon the 
manner or style in which the product 
is created, as upon the production it- 
self. In other words, the chief dis- 
tinction between these measures and 
constructive devices is the assump- 
tion of therapeutic efficacy, and the 
greater emphasis here upon the style 
or manner in which the constructive 
process is carried out. Typical of 
these instruments are play _tech- 
niques and drawing and painting 
techniques, as well as psychodrama 
and role playing devices. 

So much for this simple classifica- 
tion. It is evident that the person 
who wishes a more complex basis for 
differentiating projective techniques 
can readily introduce additional di- 
mensions. Thus, if we return to the 
six types of classification mentioned 
earlier, we can easily construct a 
typology of the sort represented in 
Table 1. In this table each test is 
represented by a double plus (++) 
in the rows that are fully appropriate 
or descriptive of the test and by a 
single plus (+) in the rows that are 
only partially or incompletely appro- 
priate. Thus, for the Rorschach test 
a double plus (++) in the “formal” 
row, and a single plus (+) in the 
“content”? row, indicates that both 
formal and content approaches are 
used in interpreting the Rorschach, 
but with greater emphasis placed 
upon formal variables. The reader 
should note that we have rated these 
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tests according to their typical or 
most highly developed use, in order 
to maximize differentiation. There- 
fore, in rating ‘‘mode of response”’ the 
Rorschach is assigned a double plus 
(++) for the ‘‘associative’’ row, and 
no rating for the “choice or ordering”’ 
row, in spite of the fact that the test 
is sometimes given in group forms 
that involve choice or ordering. It 
must be admitted that not every one 
of the judgments registered in this 
table would meet with unanimous 
approval among other psychologists, 
but the majority of the decisions are 
quite evident and with a little further 
definition of terms, and specification 
of standards of judgment, would be 
made consistently by most trained 
psychologists. 

Thus, we are able to construct a 
profile for each projective test, repre- 
senting its classification according to 
a variety of criteria. Moreover, if we 
wish, we can readily compute coeffi- 
cients of similarity or discrepancy to 
indicate the amount of association 
between the various instruments on 
these ratings. Illustrative of such an 
approach is Table 2 which presents a 
matrix of deviation coefficients (Os- 
good’s and Suci’s D index [1945}) 
that estimate the degree of associa- 
tion between the instruments as they 
were rated in Table 1. The reader 
should note that computation of 
these indices involved the minor sin 
of overlooking the fact that our rated 
dimensions are not orthogonal. 

A further step in the analysis is 
represented in Table 3 where we find 
illustrative clusters of techniques 
that seem to be similar to each other 
in their profiles. All of the tests in- 
cluded in the same cluster are linked 
by a D index that would place them 
in the lower quartile (most similar) 
of the 55 indices presented in Table 
2. The clusters are presented in order 
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of decreasing internal homogeneity. 

How meaningful are these clusters, 
and how do they compare with the 
groupings based on our classification 
according to type of response? To 
begin with, it should be noted that 
three of the clusters (Sentence Com- 
pletion Test and P-F Study; TAT, 
MAPS, and Blacky Pictures; Szondi 
and Picture Arrangement Test) are 
identical with the clusters that would 
have been derived from the single cri- 
terion of “type of response.”” Fur- 
thermore, it is clear that according 
to this matrix of indices the Word 
Association Test is the most indi- 
vidual of all the instruments, for its 
lowest D index is appreciably higher 
than those indices linking the tests 
that we have clustered together. 
Finally, it turns out that the Ror- 
schach test is singularly difficult to 
classify. It is clustered with drawing 
(painting) techniques and_ psycho- 
drama on the basis of low D indices, 
but it also shows considerable sim- 
ilarity to the MAPS Test and the 
TAT, although not with the Blacky 
Pictures. All in all, the classification 
that emerges from this somewhat 
tedious and difficult method of anal- 
ysis bears a strong resemblance to 
the simpler classification we have pre- 
sented, and at the same time it repre- 
sents a somewhat blurred outcome. 
There seems little basis at this point 
for concluding that there is any su- 
periority to such an approach. 

To summarize, this paper has 
pointed to the importance of estab- 
lishing some consistent basis for clas- 
sifying projective techniques, and has 
considered a number of possible ap- 
proaches to this problem. A classifi- 
cation based upon the mode of re- 
sponse elicited from the S was identi- 
fied as most promising, and it was 
suggested that projective technique 
responses can be divided meaning- 
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TABLE 3 


PROJECTIVE TECHNIQUES CLUSTERED ACCORDING TO PROFILE SIMILARITY 


Cluster 1 Cluster 2 Cluster 3 Cluster 4 Cluster 5 
Sentence-Comple- Thematic Apper- Rorschach Szondi Test Word Association 
tion Test ception Test Test 
P-F Study Make-A-Picture- Drawing and Paint- Picture Arrange- 
Story Test ing Techniques ment Test 


Blacky Pictures Psychodrama 


fully according to whether they in- plex (multidimensional) taxonomy 
volve: association, construction, com- there seemed to be little basis for pre- 
pletion, choice (ordering), or expres- ferring the more cumbersome method 
sion. Moreover, when this classifica- of classification. 

tion was compared with a more com- 
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